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Its completely rational. 


Microsoft’ C/C++ 70 now in- 
cludes the Windows" operating sys- 
tem version 3.1 SDK. For only $139, 
it’s hard to believe this development 
kit is so complete. We're talking all 
the latest technology programmers 
have told us they want. 

But youre still skeptical. You're 
asking yourself what is missing? 
Well, here’s an admittedly long-wind- 
ed list of what’s included. 


Object linking and embedding. 
Great for creating applications that 
share diverse types of data, like text 
and graphics. An application that 
supports OLE can cooperate with 
other OLE applications, even those 
from another vendor. 


Pen computing APIs. Everything 
you need to create compelling appli- 
cations for the next generation of 
pen computers. There are over 300 
pages of documentation that give 
you the basic elements of Windows 
for Pen Computing. 


Multimedia APIs. Our newest 
media control interface. You can in- 
corporate animation, audio and video 
capability using third party devices 
and drivers. (It’s the next best thing 
to virtual reality.) 


Windows Debugging Version. 
‘Traps the most troublesome UAEs 
and helps you create more stable, 
robust applications. Contains GDI, 
KERNEL, and USER modules. 


GUI Setup Toolkit. It will make 
hammering out a custom Windows- 
based setup program as simple as 
writing a script file. 


Microsoft Foundation Classes. 
Now you can use the same building 
blocks were using to build future 
versions of the Windows operating 
system. A rich set of 60 recyclable 
object classes provides logical order 
to more than 500 functions of the 
Windows API. Menus, GDI, OLE 
1.0 and advanced diagnostics sup- 


Offer good through 9/30/92. Reseller prices may vary. Offer good only in the 50 United States. ©1992 Microsoft Corporation. All rights reserved. Printed in the USA. Customers inside the 50 United States, call (800) 992-3675 between 6:30AM and 5:30 E 


386-Max ts a trademark of Qualitas, Inc. 


port are included. Tasks such as 
registering Windows’ classes, build- 
ing message loops, and managing 
device contexts are automatic. 


Sample code. Over 75,000 lines. 


Zoomin. A nifty tool for magni- 
fying portions of a screen to help 
identify paint problems and other 
screen-related issues. 


Heapwatker. Examine the glob- 
al heap and local heaps used by ac- 
tive applications and dynamic-link 
libraries in your system. 


Editors. And plenty of them. 
Dialog Editor lets you add, modify, 
and delete custom controls as you 
design and test dialog boxes on- 
screen. Image Editor modifies icons, 
bitmaps and cursors. Font Editor 
alters existing faces and creates 
new ones. Hotspot Editor creates 
and edits hypergraphics or bitmaps 
in one or more hotspots. 


Spy Tools. Pssst. They make 
it possible to monitor Mouse, input/ 
output from Windows, DDE and 
other messages between one or 
more Windows-based applications. 
You can also examine message 
parameter values. 


Stress. Allows you to consume 
system resources for low-resource 
stress testing. Acquirable resources 
include the global heap, user heap, 
GDI heap, disk space and file han- 
dles. (Less job stress for you.) 


Wdeb386. Now test and debug 
dynamic link libraries and Windows- 
based applications running in stand- 
ard or 386-enhanced mode. 


Multiresolution bitmap compiler. 
Combine color and monochrome 
bitmaps with different resolutions 
into a single graphic. 


Help Compiler. Create Help sys- 
tems for applications that will take 
advantage of the new Windows 3.1 
Help engine. Both context-sensitive 


and topical searches of Help files. 


Compiler features. Unrestrict- 
ed pre-compiled headers for speedy 
throughput. Explicit or automatic 
inlining of any C/C++ code for fast- 
er executions. Compressed code 
for size reductions. 


CodeView’ Debugger. This win- 


dow-oriented debugger is a power- 


ful, easy-to-use tool for analyzing 
MS-DOS’ or Windows-based pro- 
gram behavior. Test the execution 
and examine data all at the same 
time. Display any combination of 
variables — global or local— while 
you halt or trace a program’s exe- 
cution. Versions for dual and single 
monitor debugging, even in a win- 
dow, are supported. 


Source Profilers. Optimize the 
performance of all your MS-DOS- 
based or Windows-based applica- 
tions. Analyze your program to find 
code inefficiencies. 


Whew! If you were to get all of 
this separately, it would cost many 
hundreds of dollars. And definitely 
will in the very near future. 

Because the response has been 
so enthusiastic, we’ve extended 
your chance to get this upgrade for 
$139. But only for a limited time. 


~ That includes over 5,000 pages of 


C/C++ 70 documentation plus the 
entire Windows 3.1 SDK online 
documentation. We'll also throw in 
a free copy of Qualitas’ 386-Max” 
for MS-DOS-based development. 

And if you want the Windows 
3.1 SDK documentation in hard 
copy (over 9,000 pages), it’s yours 
for an additional $150. 

Look. You can defy the logic of 
thinking programmers everywhere 
and pass up this historic offer. Or 
you can call your reseller. Or just 
call us at (800) 992-3675. 
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It defies logic. 
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LARGE CHARACTER SETS FOR C 16 
by PJ. Plauger 

The internationalization of C is underway and it’s imperative that all programmers 

understand the issues surrounding localized, language-specific characters, particularly 

when it comes to wide characters and multibyte extensions as implemented in large 

character sets. 





NUMERICAL EXTENSIONS TO C 26 
by Robert Jervis 

Can C ever hope to compete with Fortran when it comes to numeric programming? 

The Numerical C Extensions Group, a subcommittee working under ANSI auspices, 

thinks so, and it may be laying the foundation for the next version of the C standard. 


MULTIPLE-PRECISION ARITHMETIC IN C 40 
by Burton S. Kaliski, Jr. 

Adding and subtracting 8- or 16-bit numbers used to be enough. But where do you 

start if you want to add, subtract, multiply, or divide 64-, 128-, or 512-bit numbers— 

particularly when using a language like C that currently supports only 32 bits? 


PERSONAL SUPERCOMPUTING: VIRTUAL MEMORY, 64-BIT 0 
by lan Hirschsohn 

Ian concludes his three-part series by discussing how virtual memory was 

implemented in PORT. Virtual memory lets you run programs that are larger than 

available RAM and call massive programs from within one another. 


INSIDE THE WINDOWS SCHEDULER 04 
by Matt Pietrek 

One of the fundamental “black boxes” in the Microsoft Windows API is the 

scheduler. Matt examines how it’s implemented and how it interacts with the rest of 

Windows. In doing so, he looks at Windows internal data structures such as the task 

database, module table, and message queue. 


MOVING FROM ASSEMBLY TO C 
by Beth Mazur 

Beth makes the case that C is increasingly becoming the language of choice for many 
embedded systems projects because it is more maintainable and portable—and just 
as efficient—as assembly language. 







HIGH-SPEED NETWORKING 86 
by William Frederick Jolitz 

Even though network systems with data transfer rates in the gigabits-per-second 

range are just around the corner, today’s software architectures won't necessarily 

scale well to higher rates. Bill examines the problems and promise of high-speed 

networking, focusing on header prediction and forward-error correction. 





Dr. Dobb’s Journal, August 1992 





ENTS 


EXAMINING ROOM 


COMPILER-SPECIFIC C EXTENSIONS 

by Al Stevens 

Borland’s C++ compiler provides C extensions to support DOS systems programming, 
that branch of programming that includes device drivers, memory-resident programs, 
and other low-level activities. Al presents a TSR driver that swaps the memory of the 
interrupted program for the TSR application’s image, executes the TSR application, 
and swaps the original program back in. 


PROGRAMMER'S WORKBENCH 


PARALLEL C EXTENSIONS 

by Barr E. Bauer 

A parallelized program is a serial program with regions of parallel execution. On 
entering a parallel region, the serial program transforms from a single process to 
multiple processes, each running one of the parallel execution threads. Barr uses 
Silicon Graphics’ IRIS Power C compiler and its parallel extensions to investigate 
these concepts. 


COLUMNS 


PROGRAMMING PARADIGMS 

by Michael Swaine 

Apple decides to get small with the announcement of its “personal digital assistant. 
But can this foray into the consumer-electronics market succeed? Michael weighs its 
chances before considering the probability of success of Apple’s other spin-offs — 
Kaleida and Taligent. 


C PROGRAMMING 

by Al Stevens 

Al puts aside D-Flat development for the moment as he celebrates his fourth 
anniversary as DDJ’s “C Programming” columnist and talks about C++ issues and 
the possibility of a pen-based D-Flat library. 


STRUCTURED PROGRAMMING 

by Jeff Duntemann 

Before trying to make sense of how Turbo Vision handles streams (which leads up to 
a revision in his mortgage calculator program), Jeff reveals that he’s changed his 
mind about software patents, thinking that maybe they’re not so bad after all. 





” 


GRAPHICS PROGRAMMING 

by Michael Abrash 

After adding illumination sources and shading to his X-Sharp 3-D graphics library last 
month, Michael provides a general-purpose color model so that you can display the 
gradations of color intensity necessary to render illuminated surfaces properly. 


PROGRAMMER’S BOOKSHELF 

by Andrew Schulman 

WEB, a programming language Donald Knuth presents in Literate Programming, lets 
you merge the executable source for a system with its description, thereby making it 
possible for software “authors” to write code in a way that makes sense to readers. 
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DEPARTMENTS 

EDITORIAL 6 
by Jonathan Erickson 

LETTERS 8 
by you 

SWAINE’S FLAMES 176 
by Michael Swaine 
PROGRAMMER’ S 
SERVICES 

OF INTEREST 170 
by Tami Zemel and 


Kathleen O’Connor 


SOURCE CODE AVAILABILITY 

As a service to our readers, all source code 
is available on a single disk and online. To 
order the disk, send $14.95 (Calif. residents 
add sales tax) to Dr. Dobb’s Journal, 411 
Borel Ave., San Mateo, CA 94402, or call 
800-356-2002 (inside Calif.) or 800-533- 
4372 (outside Calif.). Specify issue number 
and disk format. Code is also available 
through the DDJ Forum on CompuServe 
(type GO DDJ), via anonymous FTP from 
site ftp.mv.com in the /pub/ddj directory, 
and through M&T Online, a free service 
accessible via direct dial at 415-358-8857 
(8-N-1). (For M&T Online customer 
service, call 415-358-9500, extension 220.) 


NEXT MONTH 


In September, we'll examine debugging 
tools and techniques on a variety of 
platforms. 


Save Disk Space 





PAKZIP version 2.0 


PC WORLD PKWARE introduces the next generation of its award 
winning compression utility, giving even greater 
performance levels than achieved with previous 
releases of the software. PKZIP compresses and 
archives files, saving disk space and reducing file 
transfer time. 


WORLD CLASS Software developers, you can significantly reduce 

AWARD product duplication costs by decreasing the number 
of disks required to distribute your applications. Call 
for Distribution License information. 


Put Your Executables on a Diet 





Software Developers, save disk space and 
media costs with smaller executables. You 
can distribute your software in a 
compressed form with PAKLITE Professional.” 
PKLITE Professional also gives you the 
ability to compress files so that they cannot 
be expanded by PKLITE, hindering reverse 
engineering of your programs. 


PKLITE increases your valuable disk space by compressing 
executable (.EXE and .COM) files by an average of 45%. The 
operation of PKLITE is transparent, all you will notice is more 
available disk space! 


Compression for YOUR Application 


The PKWARE Data Compression 
Library allows you to incorporate 
data compression technology into 
your software applications. The 
application program controls all the 
input and output of data, allowing 
data to be compressed or extracted 
to or from any device or area of 
memory. 






Mw 
All Purpose Data Compression Algorithm compresses ASCII or Binary 
data quickly. An adjustable dictionary size allows software to be fine 
tuned for maximum compression or speed. The routines can be 
used with most popular languages, such as C, C++, Pascal, Assembly, 
Basic and Clipper. 


INC. 


9025 ™ Oeenvood Drive Grown Dect Wi 34225 
(414) 354-8699 Fax 414) 354-8559 


PAZIP 347.00 PRLITE $46.00 PAKLITE Professional 5146.00 

Please add $5.50 S&H per package in the US & Canada, $5.00 overseas. 
Wisconsin residents add 5% state sales tax & applicable county sales tax. 
Visa and Mastercard accepted, no COD orders. 
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WINDOWS AND OS/2: 
PROTOTYPE T0 DELIVERY. 
NO WAITING. 


In Windows and OS/2, you need prototypes. You have to get a sense 
for what an application is going to look like, and feel like, before you can write it. 
And you can't afford to throw the prototype away when youre done. 

With Smalltalk/V, you dont. 

Start with the prototype. There's no development system you can buy 
that lets you get a working model working faster than Smalltalk/V. 

Then, incrementally, grow the prototype into a finished application. Try 
out new ideas. Get input from your users. Make more changes. Be creative. 

Smalltalk/V gives you the freedom to experiment without risk. It’s 
made for trial. And error. You make changes, and test them, one at a time. Safely. 
You get immediate feedback when you make a change. And you can’t make 
changes that break the system. It’s that safe. 

And when youre done, whether youre writing applications for Windows 
or OS/2, youll have a standalone application that runs on both. Smalltalk/V 
code 1s portable between the Windows and the OS/2 versions. And the resulting 
application carries no runtime charges. 

So take a look at Smalltalk/V today. Call your local software dealer, or 
call Digitalk at (800) 531-2344 for 


more information. It’s time to make Smalltalk 
that prototyping time productive. 


Smalltalk/V is a registered trademark of Digitalk, Inc. Other product names are trademarks or registered 


trademarks of their respective holders. —— 
| DIGITALK 





Digitalk, Inc., 9841 Airport Blvd., Los Angeles, CA 90045 
(800) 531-2344; (310) 645-1082; Fax (310) 645-1306 





LOOK WHO'S TALKING 


HEWLETT-PACKARD NCR 
HP has developed a network trouble- NCR has an integrated test program develop- 
shooting tool called the Network Advisor. ment environment for digital, analog and 
The Network Advisor offers a comprehen- mixed mode printed circuit board testing. 


sive set of tools including an expert system, 
statistics, and protocol decodes to speed 


problem isolation. The NA user interface is Midland Bank built a Windowed Technical 


MIDLAND BANK 


Trading Environment for currency, futures 
and stock traders using Smalltalk/ V. 


built on a windowing system which allows 
multiple applications to be executed 
simultaneously. 
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KEY FEATURES 


@ World’s leading, award-winning object- 
oriented programming system 

M@ Complete prototype-to-delivery system 

@ Zero-cost runtime 


@ Simplified application delivery for 
creating standalone executable (.EXE) 
applications 


8 Code portability between Smalltalk/V 
Windows and Smalltalk/V PM 


@ Wrappers for all Windows and OS/2 
controls 


§@ Support for new CUA ’91 controls for 
OS/2, including drag and drop, booktab, 
container, value set, slider and more 


Transparent support for Dynamic Data 
Exchange (DDE) and Dynamic Link 
Library (DLL) calls 

@ Fully integrated programming environ- 
ment, including interactive debugger, 
source code browsers (all source code 
included), world’s most extensive Win- 
dows and OS/2 class libraries, tutorial 
(printed and on disk), extensive samples 


M@ Extensive developer support, including 
technical support, training, electronic 
developer forums, free user newsletter 


@ Broad base of third-party support, 


including add-on Smalltalk/V products, 
consulting services, books, user groups 














This Smalltalk/V Windows application 
captured the PC Week Shootout award — and 
it was completed in 6 hours. 





Smalltalk/V PM applications are used to 
develop state-of-the-art CUA-compliant 
applications — and they're portable to 
Smalltalk/V Windows. 














DDJ’s 1993 
Editorial 
Calendar and 
Networks — 
Neural and 
Otherwise 


the topics listed below, we'll be offering up our regular monthly fare of object-oriented 


H very year at this time, we present the editorial lineup for the coming months. In addition to 


programming tools and techniques, solutions to Windows development problems, embedded 
systems programming projects, network programming, advanced algorithms, UNIX systems, nitty- 
gritty DOS, and more. Our basic approach remains unchanged: one programmer talking to other 
programmers, sharing ideas and techniques. And there'll always be a lot of source code. With all 
this in mind, here’s what you'll being reading about in next year’s Dr. Dobb’s Journal: 


January 32-bit Programming 

February Cognitive Computing 

March Data Structures and File Formats 
April Algorithms 

May Operating Environments 
June ASM and Architectures 

July Graphics Programming 
August C/C++ Programming 
September Numeric Programming 
October Object-oriented Programming 
November Debugging and Profiling 
December Interoperability 


If you have a particular article in mind on these or other topics, or if you'd like a copy of our 
author guidelines, send us a note (DDjJ, 411 Borel Ave., San Mateo, CA 94402) or call (415-358-9500). 


Networked Systems 
We've long believed that you should write every program to be network aware, even if you don't 
initially plan on running it on a network. To underscore the importance of this, in this issue we're 
launching “Networked Systems,” a new monthly section focusing on network programming. 

In the coming months, we'll be examining all facets of software development for networked 
environments, from LANs and WANs to open (and closed) systems. Again, if you have an article 
you'd like to contribute to this section or have problems you'd like to see us address, give us a call. 


Dr. Dobb’s C++ Sourcebook 
Each year we try to give you a bonus edition of DD/ that supplements your regular monthly issues. 
This year’s special issue is entitled Dr. Dobb’s C++ Sourcebook, and if you're a subscriber, you'll 
receive it at no extra charge with your December 1992 issue. 

Included in this bonus supplement will be interviews with C++ language designers, discussions 
of why some programmers think C++ is better than Fortran for numeric programming, source code 
for some powerful class libraries, useful C++ utilities, and much more. 


Neural Net News, or Who's On First? 

Neural nets are in the news again. For starters, Bellcore has announced it’s developing a neural-net 
computer that processes 100,000 signals per second. The system eventually will be used for routing 
telephone calls, assigning radio frequencies for wireless telephones, and speech recognition. 

Caere’s soon-to-be-released Windows-based FaxMaster software uses neural nets to recognize 
and re-create characters lost or eroded during data transmission of low-resolution faxes. 

Ricoh, the Japanese company known for printers and photocopiers, has announced a high- 
speed, hardwate- only neural-network computer that will be used as an embedded controller for 
copiers. Ricoh, which is manufacturing its own 16-neuron chips, claims bragging rights to what it 
calls the first neural-net computer developed solely in hardware. 

VeriFone has unveiled a check reader that processes up to 20,000 images per second at an 
accuracy rate of 99.6 percent. The VeriFone reader is built around the Synaptics 1-1000 neural-net 
chip that’s capable of 1 billion operations per second. The PC-based reader has the 1-1000 on a 
daughterboard, plugging into VeriFone’s Motorola 68HC11-based motherboard. VeriFone and 
Synaptics claim the check reader is the first commercial application of neural-network technology. 

Hmm...this might be news to Nestor Inc. and others who for years have been developing 
neural-net software for handwriting recognition, predictive modeling, and speech recognition. I’m 
sure VeriFone and Synaptics meant to say “the first commercial hardware enim ” Tf so, Pl 
then leave it to them and Ricoh to work out who’s really on first with neural-net hardware. 


Suter 


Jonathan Erickson 
editor-in-chief 
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Maybe it’s not natural to expect 
a network of microcomputers to 


perform like a supercomputer. 


With the QNX operating system, 
it’s not natural to expect 


anything less. 


POSIX AND MORE 


Thousands of VARs and OEMs 
choose QNX for mission-critical 
applications — from POS to 
manufacturing to medical 
instrumentation. 


And with good reasons. 


Like POSIX compliance. 


And real realtime performance. 


And true microkernel architecture. 


And message-passing IPC. 
Not to mention our technical 


support and customer services. 
Now add “FLEET” to the list. 


FAULT-TOLERANT 
NETWORKING 


With most networks, 

a hardware failure spells disaster. 
But not with QNX. 

If a card or cable fails on 

a dual-net FLEET setup, QNX will 
automatically re-route data 
through the other network 

before you - or your application — 


can even blink. 


REALTIME OPERATING SYSTEM 


DISTRIBUTED PROCESSING 
WITH THE QNX® FLEET™” NETWORK. 





LOAD-BALANCING 
ON THE FLY 


For greater throughput, 
the FLEET network puts all 
available network hardware 
to work at the same time. 
And it will dynamically 
distribute the load by choosing 
the best route for the job. 


EFFICIENT PERFORMANCE 


FLEET uses network hardware 
to full advantage for maximum 
throughput. Whether you're 
running Ethernet for speed 
(application-level throughput 
at just under 1 Mbyte per second) 
or Arcnet for deterministic 
transactions — or both 
network cards in the 
same machine — you can count 


on FLEET for optimum efficiency. 


® 
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BECAUSE COMPUTERS WERE MEANT TO RUN TOGETHER. 





EXTENSIBLE 
ARCHITECTURE 


You can support new networks 
simply by adding new drivers. 
And you can start and 

stop drivers dynamically, 


without even rebooting. 


TRANSPARENT 
DISTRIBUTED 
PROCESSING 


In QNX there’s no difference 
between local execution and 
network-remote execution. 
Which means you don’t need to 
modify your applications in order 
to distribute them across 

the FLEET network. 


To sum up: networking 
with QNX is fault-tolerant, 
load-balancing, efficient, 
extensible, and transparent. 
But it’s a lot easier 

to just say “FLEET.” 


Go with the QNX FLEET network. 
Nothing runs like it. 


To find out how your 
applications can thrive in 


the QNX environment, 


call 1-800-363-9001 
(ext. 105). 
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C is the Pitts 


Dear DD, 

Reader Marty Leisner (“Letters,” April 
1992) seems to have intoxicated him- 
self on C water. I have no quarrel with 
his claim that “C lets the programmer 
do things the computer is capable of.” 
So also, with greater or lesser efficien- 
cy respectively, do assembler and Ba- 
sic, but somehow that fact slipped by 
unnoticed. I want a programming lan- 
guage that stops me from doing the 
things the computer is capable of — 
when they are the things I don’t want 
the computer to do. 

I have seen over the years perhaps a 
dozen independent benchmarks com- 
paring some aspect of C to a better lan- 
guage like Pascal or Modula-2, and | 
find it remarkable that C was beaten 
without exception. Several of the bench- 
marks were reported by persons as bi- 
ased toward C as Leisner, and these 
poor cheerleaders were reduced to a 
forlorn hope that the future would bring 
improvements to their favorite language 
sufficient to even the score. 

I think C is a wonderful programming 
language, and I hope all my competi- 
tors make full use of it. 

Tom Pittman 

Spreckels, California 


It’s the Thought that Counts 

Dear DDJ, 

Regarding Jeff Duntemann’s April “Struc- 
tured Programming” column about OOP 
and collection objects, I don’t think Jeff 
should get so depressed about the rise 
of inheritance (driven by the economics 
of code reuse) ruining the beauty and 
potential of purely encapsulated/ab- 
stracted object design. In fact, I see a 
ray of hope emanating from an idea of 
using collection classes as a basis for a 
runtime object-oriented system. 

We want a system wherein all objects 
are compatible and polymorphism is 
taken to an extreme. All objects should 
be able to respond to a given method 
invocation, either by doing something, 
doing nothing, passing the buck, or 
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complaining. We also want all of our 
data to be objects, built on other ob- 
jects solely by composition. To do this 
we have to avoid inheritance and ex- 
tend polymorphism to a composition 
paradigm. 

Another thing to work towards is a 
system that would allow us to create 
and use object classes at runtime. And 
wouldn't it be great if programs were 
object classes and an invocation of a 
program would really be an instantia- 
tion of a program object? Both an ob- 
ject’s data and methods would then al- 
so be objects. (i once worked on a 
research system where even the state 
of the processor was a data object!) 

I think one key to this system could 
be an extension of the collection-class 
concept. Essentially, a collection object 
is a set of pointers to differing objects. 
It's almost a way of performing runtime 
composition/encapsulation. However, 
the collection object and therefore its 
methods are written in stone by the col- 
lection-class definition at compile time. 

A first step might be to create a col- 
lection class with a master list of meth- 
ods applicable to its intended collection 
of members. At run time, the collection 
object could use a qualification-for- 
membership function that would inter- 
rogate nominated objects to see if they 
possessed the proper data or methods. 
Or it could be left up to the nominee 
to provide a function that could prove 
membership qualifications. One could 
then use the collection object as a run- 
time abstraction layer, providing a form 
of runtime encapsulation. 

Our collection object might allow a 
nominated member object to adopt 
some default generic methods or data 
defined in the collection class. This is 
almost a form of runtime inheritance. 
Really, the nominee would not inherit 
the method, but would allow the col- 
lector to provide a method for it. 

For those wild collections, maybe the 
nominated member and the collection 
object together would identify which 
methods on the collection object’s mas- 
ter list the nominee supports on its own, 
which of the collector’s default meth- 
ods the nominee would accept, and 
which ones it won’t deal with at all. 
This extends polymorphism to the run- 
time arena using dynamic encapsula- 
tion (or perhaps “dynamic object over- 
loading”). 

If we took this a small step further, 
the collection object could adopt the 
methods of its members. If member A 
had a method x, and member B had a 
method y, a collection object C con- 
taining A and B could have methods 
x(A) and y(B). This would really be run- 
time composition of objects. How would 





a program use an object that did not 
have defined methods at compile time? 
If our program was itself a collection 
object, it could dictate that it needs an 
x and y method from its member ob- 
jects. Or our program that used object 
C could be one that does something 
with C’s methods regardless of what the 
methods actually are, like placing them 
on a menu or running them all in some 
order. 

All this could be tracked internally to 
the collection object in much the same 
way virtual methods are tracked today, 
but allowing runtime modification of the 
virtual method tables. Windows does 
this same sort of runtime processing 
when you define farprocs and pass them 
to the Windows API. 

Somewhere we might need to dev- 
elop a communication facility so that 
objects could tell each other what their 
methods do instead of their names. 
Right now, there are only two things 
one could use to determine what a 
method does: its name and its actual 
code. Both are unsatisfactory, as names 
are ambiguous and comparing pieces 
of code is sometimes impossible (Tur- 
ing) and a bad idea (ruins abstraction). 
We need an interobject language that 
is either computed perfectly with a 
standard or one that could handle pre- 
cise translations from different corpo- 
rate dialects. 

Going way out into science fiction, 
imagine an operating-system environ- 
ment wherein everything was an ob- 
ject. Processors would run objects. 
Virtual processors would be objects. 
Peripherals would put interface objects 
into the operating-system space. Active 
operating-system collection objects 
would continuously evaluate other ob- 
jects for membership. If a new printer 
were hooked up, it would put a ser- 
vices-offered object into the system 
environment and a system printer- 
collection object would collect it, pro- 
viding a higher-level printer interface 
to other objects. 

What’s missing is that in current stat- 
ic object systems, an object class is still 
defined at compile time, limiting the 
potential for extending or modifying an 
instance of that class during run time. 
It’s really not that far from a system in 
which objects could be created dy- 
namically. The difference between an 
object-class definition and an instance 
of an object could just slowly disap- 
pear, as an object class will be itself an 
object. 

In any case, the column on collection 
objects sure got me thinking. Thanks! 

Mike Matchett 

North Little Rock, Arkansas 

(continued on page 12) 
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(continued from page 8) 

Checking Up on Checksums 

Dear DDJ, 

In the article “Fletcher's Checksum,” 
John Kodis concluded that CRC calcu- 
lation is some 20 times slower than cal- 
culating Fletcher’s checksum. I disagree. 
One does not have to shift and XOR bit 
by bit to calculate CRC. 

As pointed out by Mark Nelson in his 
article, “File Verification Using CRC” 
(May, 1992), one can “use a table look- 
up that exchanges a small increase in 
storage space for fast calculation.” 

For example, the 16-bit CRC-CCITT 
(XA164+XA12+XA5+1) can be calcu- 
lated with the Pascal code in Example 
1, using two 256-byte tables. When the 
code in Example 1 is optimized in as- 
sembly, I believe it can be just as fast 
as calculating Fletcher’s checksum. 

Lichen Wang 

Palo Alto, California 


Dear DDJ, 

John Kodis’s May 1992 article, “Fletch- 
ers Checksum,” was interesting. I like 
to fool around with CRC and PRN al- 
gorithms. My comments fall into two 
major areas: misinformation on CRC 
computational speed and weaknesses 
in the “standard” CRCs. 

There are several interesting ways to 
speed up the traditional CRC calcula- 
tions. The biggest speed-up comes from 
the most obvious design: table lookup. 
You can convince yourself that the ef- 
fect of adding another byte of data de- 
pends only on the prior state of the most 
significant eight bits of the CRC and the 
new byte of data. Any CRC generator 
can be supported, but its syndrome table 
must be precomputed. 

From my experience, any 16-bit CRC 
can be calculated in seven instructions 
per byte—not 52, as stated. Fletcher’s 
checksum, at seven instructions per 


Temp, Data, CRC_Low, CRC_High: byte; 


Table_Low: 


Table_High: array[0. 


Temp 
CRC_Low 
CRC_High 
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-255] of byte 
.255] of byte 


Data xor CRC_Low; 
Table _Low[Temp] xor CRC_High; 
Table _High[Temp] ; 


byte is still faster; just not as much as 
stated. 

Also, while I’m not usually an 80x86 
fan, note that the lowly 8088 can cal- 
culate 16-bit CRCs in only nine instruc- 
tions per 16-bit word, while the best 
68020 loop I could find requires 11 in- 
structions per word. This is because the 
68000 EOR instruction is “crippled,” and 
the 86 family can operate directly on 
AH, while the 68000 must use shift in- 
structions to access upper register bytes. 

On these machines, 16-bit CRC cal- 
culation appears to be modestly faster 
than Fletcher’s checksum. 

Mr. Kodis notes several properties 
of 16-bit CRCs. Not being the main fo- 
cus, a very short summary was pre- 
sented. One important property not 
mentioned that all standard CRCs share 
is that any block containing an odd 
number of errors is detected. This is 
accomplished by ensuring that G(x) is 
divisible by (x+2). 

Unfortunately, I know of no com- 
munication or storage method which 
tends to make mostly odd numbers of 
errors. Further, differential codes tend 
to make errors in pairs. Thus, the prop- 
erty of codes is, at best, useless. Note 
that in a channel which pairs errors, on- 
ly the 15-bit component G15=G/(x+ 1) 
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(continued from page 12) 

is actively detecting errors. This doubles 
the undetected-error rate expected with 
a prime generator. Also, for high error 
rates, some standard CRC codes de- 
generate: The undetected-error rate can 
actually climb above 2-16 (!) Cn this 
regard, CRC-ANSI is much worse than 
CRC-CCITT, for some reason.) 

Thus, for compatibility you should se- 
lect a standard CRC, preferably CRC- 
CCITT. Otherwise, you should select a 
prime generator if detection performance 
is key, especially at high-link error rates. 

Another important property of CRCs 
is that an N-bit CRC always detects 
blocks with a single burst of errors of 
length<=N. If your error-generation 
process tends to make bursts of errors, 
this fact can aid in your analysis of 
how well the resulting system will 
work with CRC. 

Peter Hanson 

Santa Clara, California 


John responds: I would like to thank 
Messrs. Wang and Hanson for the feed- 
back on my article, particularly Mr. Han- 
son’s explanation of some additional 
properties of the CRC. They both make a 
valid point: that performing CRC gener- 
ation a byte or a word at a time can pro- 


vide significant speed improvements over 
the bit-oriented approach I referred to. I 
still believe that Fletcher's technique will 
be significantly faster than the table-driv- 
en CRC approaches cited, since Fletch- 
er’s method requires fewer instructions 
and no memory-fetch time for the table- 
lookup operations. 

When selecting a data-validation 
method, a trade-off must be made be- 
tween high data integrity and high 
computational effort. The main point I 
wanted to make in my article was that 
Fletcher's technique provides designers 
with an excellent alternative to the more 
common checksum and CRC methods. 


A Programmer by Any Other Name 
Dear DDJ, 

In his April 1992 “C Programming” col- 
umn, Al Stevens insists that program- 
mers will not become obsolete. He’s 
right, but he’s also wrong. 

He cites doctors, accountants, and car 
salespeople as examples of people who 
could never write their own programs. 
All, however, will create a spreadsheet 
or request a report from a database. 
These tasks would have required a pro- 
grammer to write a formal program on- 
ly a few years ago. More likely, they 
would not have been done. 


I would argue that the doctor, ac- 
countant, or auto seller is programming. 
Not only can accountants write pro- 
grams, they do. But calling it “program- 
ming” would scare off the people who 
do it. Programming has become so easy 
it isn’t called programming any more. 

When Fortran was developed in the 
late 1950s, it was intended to eliminate 
programmers. This can be regarded as 
laughable, but in fact it succeeded— 
within the context of the time. Com- 
puting meant scientific or engineering 
number crunching. Those programs are 
still being written, almost always by a 
scientist or engineer rather than a pro- 
fessional programmer. 

Today much computing consists of 
common applications such as word pro- 
cessing or spreadsheets. Even very so- 
phisticated applications can be pur- 
chased, and the people who buy them 
work in a user department rather than 
in the information services department. 

Our idea of programming has changed 
over the last 30 years. There will prob- 
ably always be programmers—but they 
won't always do what they do now. 

Jim Caughran 

Willowdale, Ontario 
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Large Character Sets 
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The internationalization of C is underway 





PJ. Plauger 


is the only standardized programming language that 

supports large character sets. That will not always 

be true. The Japanese have made their position clear 

to ISO, the international organization that standard- 

izes programming languages. Several years ago, they 
announced their intention to veto any future language stan- 
dards that do not contain similar support. Wisely, ISO passed 
a resolution endorsing the Japanese position. 

C could have become the last standardized programming 
language that did not support large character sets. The 
Japanese were willing to exempt the C Standard because, at 
the time, it was very near completion. Many of us had al- 
ready put five or more years into standardizing C. We were 
tired and ready to quit. But we didn’t want to be the last of 
an old breed, not after all that work. Rather, we chose to do 
the extra work that made us the first of the new breed. 

To do so, we had to bend our self-imposed rules a bit. 
Standard C is highly compatible with the C of Kernighan & 
Ritchie. We resisted rather well the numerous temptations to 
“fix” the language—particularly where such fixes would re- 
quire existing C code to change. We did add a number of 
features. Some of the additions are changes to the language 
proper. Most are pure add-ons, such as new library func- 
tions. A// of the additions, however new they may appear to 
many, were based on some form of prior art. Even function 
prototypes and type qualifiers (such as cons?) were derived 
from C++ and other dialects of C. 

It was harder to find precedents for manipulating large 
character sets, at least the way we chose to do so. True, sev- 
eral companies have provided Kanji support libraries for a 
number of years. A few have permitted limited inclusion of 
Kanji characters within C source code itself. Nobody had cho- 
sen to be as ambitious as we felt we had to be. Like it or 
not, we had to be inventive. 

We were equally inventive in adding “locales.” That’s the 





PJ. Plauger, whose most recent book is The Standard C 
Library (Prentice Hall, 1992), is a member of the ISO JTCT/ 
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machinery we added to make the Europeans happy. They 
were just ahead of the Japanese in requesting that C be made 
more international. A locale summarizes many of the conven- 
tions of a given culture. Francophones want their dates spelled 
out in French. Accountants want negative numbers to print 
with a trailing “DB” instead of a leading minus. Dictionary 
writers want words to sort in a funny way. The locale machin- 
ery added to C is intended to support an open-ended set of 
such cultural conventions by defining and mixing locales. 

If you are not conversant with locales and large-character 
support in C, don’t fret. The material is so new that many 
experienced C programmers barely understand the basic con- 
cepts. More important, most C programmers don’t need to 
care. At least not yet. The push for internationalization has 
just begun, and it may be years before your corner of the 
world will feel the impact. 

But don’t feel you can ignore this topic indefinitely. The 
international marketplace for software is growing fast. Who- 
ever pays your salary will soon care very much about meet- 
ing that growing demand with an economy of effort. Stan- 
dard C offers that economy better than any other programming 
language in use today. It behooves all professional program- 
mers to understand the issues involved in this new field of 
“internationalization.” 

My focus in this article is primarily on support for large 
character sets. If you want to learn more about locales as 
well, see my book, The Standard C Library (Prentice Hall, 
1992). It discusses the entire C library, but pays particular at- 
tention to features added for internationalization. 


Representing Large Character Sets 

When Europeans talk about large character sets, they usual- 
ly mean sets with extra language-specific characters. The 95 
graphics defined in the U.S. form of ISO 646 (also known as 
ASCID are not enough. Practically every European alphabet 
defines additional characters or accented versions of the En- 
glish characters. In fact, I am told that only three languages 
in the world can get by with just the 26 letters of the com- 
mon subset of ISO 646—English, Hawaiian, and Swahili. 
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An old trick for €xpanding a character set js lO give each 
code multiple Meanings. The old Teletype Model 37, for ex- 
ample, could Print both English and Greek characters. Send 
the terminal 4 “shift out” code (SO) and it began speaking 
Greek. A “b” Printed as a beta, as J recall, Subsequent char. 
acters also printed funny until yOu sent a “shift in” code (SI). 
The termina] then reverted tO more Customary behavior: see 
Figure 1, 
dracter code this Way, 
ach code depends on 
assume, for example, that 
each sequence of begins in an “Initial shift state.” 
For our Model 37, that would be Printing English characters. 
Most characters that follow are interpreted in this context to 
determine the “metacharacter” you really mean to designate. 
Some characters simply alter the Current shift state. They spec- 
ify no Metacharacter at all, at least not by themselves. The 
Model 37 code May have (almost) doubled the number of 
characters you can represent, but it must maintain one bit of 
State information to determine each Metacharacter 
The Japanese JIS code takes this approach a Step farther, 
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INTERNATIONALIZATION 


In the initial shift state, each character defines a single 
metacharacter. ASCII is ASCII. You shift out to Kanji with 
the three-character sequence \33$B (ESC, dollar sign, cap- 
ital B). In this state, each subsequent pair of characters de- 
termines a single metacharacter. Both the first and second 
characters of a Kanji pair must be in the range [0x21, Ox7el. 
You shift in to ASCII with the three-character sequence \33(B; 
see Figure 2. 

Some simple arithmetic tells you that you can specify near- 
ly 10,000 distinct metacharacters with JIS. That’s nowhere 
near all the Kanji characters—only the more popular ones 
are included. Still, it’s worlds better than the mere 256 codes 
supported by a single 8-bit character. The price once more 
is added complexity. Parsing a JIS string takes work. It re- 
quires state memory just like the Model 37 code. And op- 
portunities abound for making malformed strings. 

It is possible to eliminate the need for state memory. The 
Japanese Shift JIS code sets aside certain character codes to 
signal the start of a two-character sequence. A character in 
the range [0x81, 0x9f] or [OxeO, Oxfc] must be followed by a 
character in the range [0x40, Oxfc]. Together, these define a 
single Kanji metacharacter. Any other first character defines 
the metacharacter all by itself. (Again, ASCII is ASCID. See 
Figure 3. 

Extended UNIX Code is a variation on the same thing. It 
was contrived to simplify the conversion of many UNIX util- 
ities to processing Kanji text. Essentially, any character with 
its sign bit set Gin the range [0x80, Oxff]) is part of a two- 
character sequence. No shift state need be retained. But you 
still need to keep track of where you are within a multiple- 
character sequence. 

(I have studiously avoided using the obvious terms “byte” 
and “multibyte” in this description. I have been equally care- 
ful to distinguish between “characters” and “metacharacters.” 
C has long had the rule that a character occupies a single 
byte. C often lives on machines where a byte consists of 8 
bits. That has led to endemic confusion between the notions 
of character, byte, and octet of bits, and the confusion will 
not soon disappear.) 

The reasons for using multibyte sequences should be ob- 
vious. We live in a world of character streams. Disks, diskettes, 
parallel ports, and serial ports all traffic in sequences of 8- 
bit bytes. To ignore this world would be foolish. A large char- 
acter set must be representable as sequences of bytes. 
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Figure 1; Model 3 7 code for printing Greek. — 





Figure 2: JIS code for printing Kanji. 





Figure 3: Shift JIS code for printing Kanji. 


Yet there are equally obvious drawbacks to multibyte se- 
quences. You can’t manipulate individual characters without 
a lot of parsing. You can’t paste strings together without care- 
ful thought about shift states (in the general case). At the 
very least, you may have to introduce many redundant shift 
sequences to be on the safe side. 


Wide Characters 

If you want to manipulate characters inside a program, it’s 
easiest if they’re all the same size. An alternate representa- 
tion for large character sets has just this property. A wide 
character is an integer large enough to represent distinct 
codes for all the characters in the set. It can be type char, 
short, int, or long. Or it can be one of the unsigned versions 
of these types. Standard C provides the type definition wcehar_t 
for the wide-character type. Include either of the headers 
<stddef.h> or <stdlib.h> to define this type. 

Just as there are several multibyte encodings for Kanji, 
there are also several wide-character encodings. The more 
popular ones are easily derived from one of the multibyte 
encodings. Essentially, you cut and paste bits from the two 
characters in the multibyte representation to make the wide- 
character code; see Figure 4. 
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Run-time compatible with WATCOM FORTRAN 77/386 


32-bit DOS support includes the DOS/4GW 32-bit DOS extender by 
Rational Systems with royalty-free runtime license 

Virtual Memory support up to 32Mb 
32-bit Windows support enables development and debugging of 
true 32-bit GUI applications and DLLs. 

Includes licensed Microsoft SDK components 


32-bit OS/2 2.0 support includes development for multiple target 

environments including OS/2 2.0, 32-bit DOS and 32-bit Windows 
Access to full OS/2 2.0 API including Presentation Manager 
Integrated with IBM Workframe/2 Environment 
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The Industry’s 
Choice. 
Autodesk, Robert Wenig, Manager, AutoCAD for Windows: 
“At Autodesk, we’re using WATCOM C/386 in the development 
of strategic new products since it gives us a competitive edge 
through early access to new technologies. We also highly 
recommend WATCOM C/386 to third party AutoCAD add-on 
(ADS and ADI) developers.” 


Fox Software, David Fulton, President: “FoxPro 2.0 itself is 
written in WATCOM C, and takes advantage of its many superior 
features. Optimizing for either speed or compactness is not 
uncommon, but to accomplish both was quite remarkable.” 


GO, Robert Carr, Vice President of Software: “After looking at the 
32-bit Intel 80x86 tools available in the industry, WATCOM C was 
the best choice. Key factors in our decision were performance, 


functionality, reliability and technical support.” 


IBM, Jobn Soyring, Director of OS/2 Software Developer Programs: 
“IBM and WATCOM are working together closely to integrate these 
compilers with the OS/2 2.0 Programmer’s Workbench.” 


Lotus, David Reed, Chief Scientist and Vice President, Pen-Based 
Applications: “In new product development we're working with 
WATCOM C because of superior code optimization, responsive 
support, and timely delivery of technologies important to us like 
p-code and support for GO Corp’s. PenPoint.” 


Novell, Nancy Woodward, VP and G.M., Development Products: 
“We searched the industry for the best 386 C compiler technology 
to incorporate with our developer toolkits. Our choice was 


WATCOM.” 
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The Leader in 32-bit Development Tools 


415 Phillip Street, Waterloo, Ontario, Canada. Telephone: (519) 886-3700, Fax: (519) 747-4971. *Price does not include freight and 
taxes where applicable. Authorized dealers may sell for less. WATCOM C and Lightning Device are trademarks of WATCOM Systems 
Inc. DOS/4G and DOS/16M are trademarks of Rational Systems Inc. Other trademarks are the properties of their respective owners. 


Copyright 1992 WATCOM Products Inc. 
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3.0 & 3.1! 


KILLS BUGS DEAD. 
KEEPS BUGS AWAY. 


SLASH DEVELOPMENT TIME 


Every year the average C programmer without MemCheck will spend one whole week 
looking for overwrites and heap corruption. Leaking memory. That’s right, one full 
week of “debugging lessons.” Maybe you're a little tired of that. 


THE 15 MINUTE SOLUTION: 


1) Include <memcheck.h> in each source module using MemCheck's auto- 
configuration utility. 

2) Place calls to start and end checking in your main() program. 

3) Recompile (first time only) and link with a MemCheck library. 


Switch MemCheck on and off from the command line at runtime, using an 
environment variable. Link with the production library, quickly removing all 
MemCheck interaction with your applications and reducing code size overhead to less 
than 1K. If you want to cleanly remove all references to MemCheck from your 
program, simply recompile with NOMEMCHECK #defined | 

“A landmark product — StratosWare is a class act” Joe Terry 


How It Works 


_ Utilizing 100% C-compatible macro interception, the MemCheck 

@ libraries act as a completely transparent code “layer” between your 

| application code and memory allocation routines. Pop-up error 
messages pinpoint overwrites, data loss, null pointer assignments, 
freeing blocks twice, freeing bad blocks, memory leakage, and other instances of 
data loss and heap corruption. No code changes required! 

“Of all the tools I've used, nothing comes close to the improvement offered by our investment 
in your product” Rob Williford 





Unlike Nu-Mega Technologies’ Bounds-Checker, which only guards against invalid 
memory accesses outside your application, MemCheck looks inside your code for the 
everyday memory allocation errors that account for most application crashes. PC 
Techniques magazine says, “The combination of Bounds-Checker and...[MemCheck] 
will provide the serious developer with a formidable arsenal.” 


Microsott Co ccccciiciccseccccsscstesesseys $139 Call now! 

Borlaad TC, 1C4+, BCs ......... $139 see | satisfaction-guaranteed offer! 
Wate Ce $179 Adda ‘smartbomb toyourdevelopmentioolarsenall 
Intel Code Builder ..1...004.5.1.04.5. $179 

Windows SDK ioc... ssccscsccosvescesese $179 TO ORDER CALL 

Borland C++ for Windows ........ $179 

Think C (Macintosh) ..............0+. $179 | - 8 0 0 - W E - D E B U G 


DOS Masters Pack 
(Microsoft C and Borland C) ..... $199 


CANADIAN AND INTERNATIONAL CALL 
Windows Guru Pack Ce SS EB ea De DG 2 9 A 4.) 
(SDK and Borland/Windows) .... $249 





STRATOSWARE CORPORATION 


All major credit cards accepted. Same day shipping on orders placed before 4PM EST. Fax orders welcome — FAX (313) 
996-2955 or (313) 747-8519. Please specify compiler make and version. MemCheck runs on IBM PCs or compatibles; 
Macintosh or compatibles; no special hardware required. MemCheck may be used with all linkers, overlay managers, DOS 
extenders and deb: . MemCheck™ is a registered trademark of StratosWare Corporation. Bounds-Checker™ is a 
trademark of Nu-Mega Technologies. Stratos Ware Corporation, 1756 Plymouth Road, Suite 1500, Ann Arbor, Michigan 
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INTERNATIONALIZATION 


(continued from page 18) 

Wide-character encodings tend to be a private matter for 
each implementation. Imagine trying to exchange data be- 
tween two different systems by shipping wide characters. 
First you must make sure that both implementations of C use 
the same number of bits to represent wchar_t. Then you 
have to worry about whether the byte orders are the same. 
In the general case, you have to transform the code values 
in some way. It’s much easier simply to read and write a 
common multibyte code. Then you don’t much care about 
the various internal forms for wide characters. 

C programmers do care somewhat about wide-character 
codes. Code value 0, for example, must be reserved for the 
null wide character. Otherwise, wide-character strings are a 
nuisance to manipulate. And you want @’ to have the same 
numeric value when converted to a wide character. In fact, 
any value you can store in an unsigned character should 
have the same numeric value when converted to a wide char- 
acter. Otherwise, all sorts of subtle but nuisancy problems 
arise. The C Standard endorses no particular wide-character 
encoding, but it does impose a few restrictions on accept- 
able code sets. 

The C Standard imposes similar constraints on multibyte 
encodings, by the way. Code value 0 always stands for the 
null character. It can never appear as part of a longer char- 
acter sequence representing some metacharacter. If the en- 
coding has shift states, then the initial shift state is somewhat 
constrained. All the basic C characters (the ones you need 
to express a C source file) stand for themselves. Put anoth- 
er way, ‘a’ stands for lowercase “a” in the initial shift state. 
It is never the first character of a longer character sequence. 
Again, these few constraints let the C programmer use proven 
techniques to manipulate even multibyte strings. (For a dis- 
cussion of the implications of wide-character on C++, see 
the accompanying text box, “So What About C++?”.) 


Extensions to C 

We added as little as possible to C to support multibyte and 
wide characters. In a C source file you can write a multibyte 
sequence in one of the following ways: 


e As part of a comment. 
e Within a “wide-character constant” such as L'x’ 
e Within a “wide-character string literal” such as L"kon ban wa". 


In the last two cases, you specify one or more wide char- 
acters in the executable code by writing multibyte sequences 
in the C source. In all cases, the multibyte sequence must 
begin and end in the initial shift state Gif shift states matter). 
It is up to each implementation to choose multibyte and 
wide-character encodings. Note that an identifier cannot in- 
clude a multibyte sequence—you're still confined to the En- 
glish alphabet for contriving names. (Several proposals are 
kicking around ISO to generalize the rules for writing iden- 
tifiers in all programming languages, however.) 

You can also write multibyte sequences in all the formats 
used by the library print and scan functions. That lets you 
intermix multibyte literal text with converted values on out- 
put. It also lets you match such text on formatted input to a 
limited degree. The problem with input comes, as usual, with 

(continued on page 24) 





Figure 4: Converting Shift JIS to wide character. 
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Paint GUI screens. 


Multiple Methodology Support. 


Work with such methodologies as De- 
Marco/ Yourdon, Gane & Sarson, Ward 
& Mellor (real-time), Entity Relation dia- 
grams, Decomposition diagrams, Object 
Oriented Design (optional), State Transi- 
tion diagrams, Flow Charts and SSADM. 


Integrated Data Dictionary/ 
Encyclopedia. 


Customize your own data dictionary / 
encyclopedia, and get multi-user sup- 
port with and without a network. 


Advanced Features. 


Take advantage of normalization, rules 
& balancing, requirements traceability, a 
network version, import/export capa- 
bilities, custom reporting, an extendable 
data dictionary, auto leveling, CRUD 
Matrices, and automated documentation 
(enhanced 2167A -- call for availability). 
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Generate schema from entity diagrams. 


A Proven CASE Product. 


IEEE Software Magazine called System 
Architect "a useful, well- 

planned, affordable POPKIN 
CASE tool." CASE Trends 
found System Architect 
"to be extremely easy to 
use... . with many 
features that are com- 
pletely lacking in higher 
priced competitors." 
Toshiba said "System 
Architect stood out from 
many other prospects." 
And System Builder 
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View data models, DF Ds, and structure charts simultaneously. 


Works under Windows 3 
or OS/2 Presentation Manager. 


You get full functionality in each 
environment. 


Affordable. 


System Architect costs from one half to 
one fifth the price of comparable CASE 
products. 


FOR THE REAL-WORLD 
CASE SOLUTION, CALL 
(212) 571-3434 TODAY.* 
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‘lo a software 
developer, this is what 
heaven looks like. 


Most people wouldn't know what to make of a screen like 
this. But developers like you know a screen like this can 
help make all kinds of applications. With OS/2° 2.0, you 
can develop the DOS, Windows” and OS/2 apps end users 
need. And you can do it faster and easier than ever 
before. Because OS/2 2.0 can make the most of your 386 
or 486 processor. 

Now you can edit in one window, compile in 
another, profile in a third and test in a fourth. Pre- 
emptive multitasking makes everything run smoothly 
and responsively. And OS/2 Crash Protection” helps 
shield running applications from each other, so if one 
goes down it won't affect the others. Instead of reboot- 
ing, you just restart the affected app and continue. 

And since OS/2 2.0 is a 32-bit operating system, pro- 
grams are easier to write and run faster. too. Which all 
adds up to improved productivity and reduced develop- 
ment cycle time. 

To keep your cycle rolling, OS/2 2.0 comes with 
HelpWare™ OS/2 Support line, Bulletin Board and IBM 
Link. All for less than the cost of DOS and Windows. 
And through September 5, you can save on [BM pro- 
gramming tools. IBM C Developers WorkSet/2 is now 
available for $295—$600 off the list ' 
price. The WorkSet/2 includes a 32-bit o\ 
C Compiler and debugger, a toolkit and \y 
IBM WorkFrame/2. For an IBM autho- 
rized dealer near you, or to order OS/2 


2.0 from IBM, cali 1 800 3.1BM-OS2% 






¢ OS/2 Crash Protection helps shield applications from each other: 
> The integrating platform of choice for DOS, Windows and OS/2. 
- Preemptive multitasking for responsive, reliable execution. 

- 32-bit flat address space Jor productive programming. 

: A full range of IBM services and support. 
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(continued from page 20) 
shift sequences. They let you specify the same sequence of 
metacharacters many different ways. But the scan functions 
still match literal text character by character. That can lead 
to all sorts of unpleasant surprises for innocent users. 

The only other addition to the C Standard is a handful of 
library functions. The header <stdlib.h> now declares the fol- 
lowing functions: 


e mblen, for determining how many characters in a multi- 
byte sequence constitute the next metacharacter. 

e mbtowc, for converting a single metacharacter from multi- 
byte to wide character. 

e wetomb, for converting a single wide character to a multi- 
byte sequence. 

e mbstowcs, for converting a null-terminated, multibyte string 
to a null-terminated, wide-character string. 

e wcestombs, for converting a null-terminated, wide-charac- 
ter string to a null-terminated, multibyte string. 


Besides the type wchar_t mentioned earlier, the library al- 
so defines two macros. These help you allocate work buffers 
for code that converts between multibyte and wide-character 
encodings: 


e MB_CUR_MAX, defined in <stdlib.h>, is the length of the 
longest permissible multibyte sequence for a single 
metacharacter in the current locale. 

e MB_LEN_MAX, defined in <stddef.h>, is the same length 
across all locales. 


Yes, an implementation can change its multibyte and wide- 
character encoding when it changes locales, at least in prin- 
ciple. Such antics are fraught with peril, however. I suspect 
that only the more ambitious implementations will permit 
such games. 


Future Additions 
All sorts of additional functions would be useful for manip- 
ulating large character sets: 


e Wide-character analogs of the <ctype.h> and <string.h> 
functions. 

e Wide-character analogs of the conversion functions in 
<stdlib.h>, such as strtod and strtol. 

e Wide-character analogs of the string I/O functions sprintf, 
usprintf, and sscanf. 

e I/O functions that convert automatically between multi- 
byte files in the outside world and wide characters inside 
the program. 


We did think about these issues when we drafted the C 
Standard. But remember we already felt that we were run- 
ning late. So we chose to include only the bare minimum of 
functionality. We figured that more extensive library support 
would emerge as people understood better how to manip- 
ulate large character sets in C. 

We figured right. The Japanese have proposed an exten- 
sive addition to the Standard C library. It includes all the 
functions outlined above. It also describes some of the sub- 
tler semantic issues in greater detail. ’'ve glossed over many 
such issues here because of space limitations. 

The ANSI C Standard was approved in 1989. ISO C fol- 
lowed in 1990. Normally, a language standard remains sta- 
ble for at least five years before it gets revisited. You’d think 
the Japanese proposal had missed the boat, but thanks to an 
accident of ISO politics, that’s not the case. For a variety of 
reasons, the ISO C committee has the charter to produce a 
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“normative addendum” to the C Standard. It wasn’t hard to 
convince the committee to include the Japanese proposal as 
part of that addendum. 

The net result is that the C Standard will likely be changed 
within the next year or so. Essentially, that change will in- 
corporate the Japanese extensions to large-character support. 
The extensions are confined to the library, and they are fair- 
ly pure. That means that existing C programs should not 
change meaning when these new functions are added. Your 
biggest worry will be whether any existing external names 
collide with the names of added functions. And that, as we 
all know, is a perennial problem with progress. 


Living with Large Character Sets 

Now you know the basics of large-character set support in 
Standard C. What should you do about it? As I mentioned at 
the outset, you probably don’t have to do much of anything 
right now. What you do in the near future depends on your 
expectations for the code you write. 

If you believe your code will never care about large char- 
acter sets, you can generally ignore them. We tried to con- 
trive the C Standard so the cost is low for those who don’t 
use large character sets. Even implementors can get off cheap. 
A C compiler for a small microprocessor can, for example, 
define wchar_t as type char. The five conversion functions 
then become trivial. The print and scan functions don’t have 
to change. Your code can stay lean and mean. 

For many applications, a wiser approach is to make it 
multibyte tolerant. Remember that a multibyte string often 
looks like any other null-terminated string. You wouldn’t sec- 
ond guess the structure of a filename in a portable program, 
would you? Then learn to be just as tolerant of text strings 
you read and write. They might one day be multibyte strings. 
If you don’t try to chop them up or paste other characters 
in the middle, they will probably survive passage through 
your code. Who knows, your application may one day start 
speaking Japanese or Arabic. 

Some applications must learn to be multibyte aware. You 
use the multibyte parsing functions religiously when manip- 
ulating strings. You probably want to adapt to the locale pre- 
ferred by each user. (My book The Standard C Library con- 
tains complete code for manipulating locales and large 
character sets with varied encodings.) You may even want 
to use arrays of wide characters for manipulating some text. 

A few applications will have to be wide-character orient- 
ed. These work exclusively with wide characters instead of 
conventional characters. They convert to and from multibyte 
characters only when communicating with the outside world. 
Such applications really benefit from the additions to Stan- 
dard C proposed by the Japanese. (I understand that Win- 
dows NT fits this description.) 

My personal belief is that conventional character strings 
will not soon go away. They meet most of our needs, even 
when dealing with large character sets. But I also see a grow- 
ing use of wide characters in the years to come. Interna- 
tionalization is a major driving force, but it is not the only 
one. Remember that large character sets have uses well be- 
yond Japanese word processors. They can also be handy for 
representing characters of different point sizes or colors in a 
typesetting package. Or they can represent musical notes of 
different pitches and durations. I leave other uses to your 
imagination. 


DDJ 
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Numerical 
Extensions to C 





An NCEG status report 


here’s a joke in the supercom- 

puting community that scien- 

tists don’t know what program- 

ming language they'll be using 

in the 21st century, but they do 
know it'll be spelled “Fortran.” If the ex- 
tensions being designed by the NCEG 
succeed, 21st century scientists may in- 
stead spell it “C.” 

The Numerical C Extensions Group 
(NCEG) began as a working group of 
members of the ANSI C committee in- 
terested in numerical programming is- 
sues, and it has been meeting regularly 
since mid-1989. In March 1991, the 
group was Officially recognized as a sub- 
committee working under ANSI with the 
designation of X3J11.1. This group will 
not produce a full standard, but instead 
will write a technical report. In the world 
of standards a technical report does not 
carry as much weight because govern- 
ment agencies will not demand confor- 
mance to it. However, the NCEG report 
will act as practical guidance to anyone 
wanting to extend C in the directions 
covered by the report. When the C stan- 
dard is next revised around 1995, the di- 
rections of the NCEG report will be im- 
portant input for the new standard. 

The group has organized its activities 
into eight topics: aliasing, array syntax, 
IEEE floating-point support, complex 
arithmetic, variably dimensioned arrays, 
exceptions and errno, aggregate initial- 
izers, and extended-integer range. 


Bob is an independent consultant, pro- 
gramming, writing, teaching C courses, 
and developing new operating-system 
and language software. He can be 
reached at bjervis/rbj@uunet.uu.net. 
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Progress on each of these topics has 
proceeded at its own pace, with wide- 
ly varying degrees of activity and inter- 
est. Some of these topics, such as [EEE 
floating point, have proceeded to the 
point that drafts of their section of the 
report are circulating among other com- 
mittees for comment. Others, like the 
recently added extended-integer ranges, 
still have fundamental issues to resolve. 


Aliasing 

An alias exists whenever you have more 
than one way to reach a piece of mem- 
ory. When a pointer contains the ad- 
dress of a static or automatic variable, 
that pointer is an alias of the variable. 
When two pointers point at the same 
memory location, they are aliases for 
the same object. 


Pointers in C have always presented 
formidable problems for optimizing 
compilers, because C code has been 
written for years with numerous alias- 
es and no way to tell the compiler 
where the aliases are present. C code 
routinely bumps a pointer through an 
array, for example, instead of using sub- 
scripting. 

When a compiler tries to optimize 
code that contains references using sev- 
eral pointers, the compiler must do one 
of two things: It must either trace the 
history of those pointers to determine 
whether they are aliases or not, or as- 
sume the worst and act as if the point- 
ers might be aliases. Most advanced op- 
timizing compilers do go to the trouble 
of tracing histories, because the bene- 
fits can be profound. 

On a PC, knowing that two pointers 
are not aliases can help the compiler 
keep a few temporaries in registers 
rather than have to recompute values. 
In a tight inner loop that might mean 
generating half as many memory refer- 
ences. On a supercomputer, however, 
knowing two pointers are not aliases 
can determine whether or not the com- 
piler can use vector instructions. In the 
same tight inner loop, using vector in- 
structions can mean more than a factor- 
of-ten improvement in speed! 

When tracing the history of a point- 
er, a C compiler runs up against a brick 
wall at the top of a function. A compil- 
er doesn’t know whether the arguments 
passed to a function are aliases. Unless 
the compiler can find all the calls to the 
function to continue tracing back (and 
I don’t know of a C compiler that even 
tries), the compiler must give up and 
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M, colleague who did not know about the 
MultiScope 386/486 feature, spent over two 
days looking for a memory trashing bug 
which was very difficult to find. I told him: 
let me show you how to find it in less than 
five minutes. We tried the 386/486 
watchpoint and indeed we found it right 
away! This paid for the product right there! 


n addition, what I like about MultiScope 
is its Windows interface, it allows me to view 
a lot of information about the program - and 
get to it quickly. 


For your real C++ debugger, please call: 


MultiScope Sales 








M ultiScope is also the ONLY professional debugger 


that offers WYSIWYG C++ debugging. You see symbols 
the way they are written in the program. MultiScope is 
definitely the best debugger for Windows! 
Peter Merrill, Computer Scientist for Adobe 
PhotoShop 

— Adobe Systems 


Real C++ debugging support including name unmangling, class browsing, C++ 
Expression Evaluation and other C++ features eWindows hosted debuggers °Full 
Support for Windows 3.1 Find out the cause of (U)AEs - fast using the MED shield and 
the Crash Analyzer Ability to post Windows Messages at run-time *Superior Message 
spying and Message Breakpoints Fast 386/486 Hardware Watchpoints Graphical 
rendering of complex data structures *Allows remote debugging over a network or a serial 
cable ¢Single or dual monitor debugging «VCR Remote Control 
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NOW YOU CAN LICENSE 
P. J. PLAUGER’S 
STANDARD C LIBRARY 


LOCALE CONTROL 


= Portable and extensible locale files 
for srk ene C code that 
adapts to various locales. 


= Provides ways to describe colla- 
tion sequences, character classifi- 
cations, and numerical format- 


g. 
Laapnnanis efficient and port- 
e monetary formatting struc- 
tures that ease internationaliza- 
tion of your applications. 


LARGE CHARACTER SETS 


=Support for large character sets, 
both as sequences of bytes (mul- 
tibyte) and as large integers (wide 
char), in accordance with the 
ANSI/ISO C Standard. 

=Intersperse conventional com- 
puter text (such as ASCII) with 
various other character sets (such 
as Kanji, Arabic, Russian, or as- 
sorted European alphabets). 

= Flexible and compact notation for 
switching among diverse charac- 
ter sets. 


MATH FUNCTIONS 


= All functions are precise, port- 
able and accurate to within two 
bits for up to 56 bits of precision. 


PROJECT MANAGEMENT 


= The economical way for augment- 
ing current C Library witha com- 
lete ANSI/ISO Standard C Li- 
rary 
= A portable way for retargeting C 
Code application to another en- 
vironment. 
= Low maintenance — license in- 
cludes one year of Plum Hall Inc. 
maintenance. 
= A convenient programming tool for 
large programming project. 
LICENSING TERMS 


= Permission for making limited 
number of copies of The Standard 
C Library (either partial or com- 
plete) for distribution. 

= Allows use of The Standard C Li- 
brary on a network 

= One year’s maintenance 
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(continued from page 20) 
assume that aliases might be present. 

Some of you may remember that the 
original ANSI C committee put a key- 
word into the language called “noalias” 
to solve this optimization problem but 
withdrew it. Since that attempt, people 
have gone back and tried to eliminate 
the major problems with noalias. The 
proposal currently being considered us- 
es a different keyword, “restrict.” The 
basic idea is that any pointer can be de- 
clared as being a restricted pointer, 
which means that when the pointer is 
created, it is not an alias of anything 
else. This is really a promise from the 
programmer. 

If you declare a pointer-function pa- 
rameter to be restricted, you are promis- 
ing that no one will call this function 
with an alias in that parameter. When 
the compiler traces back to the top of 
the function, it finds your promise and 
can conclude that no aliases are pre- 
sent. The variable in Example 1(a) is re- 
stricted. Only pointers used to modify 
memory really matter as far as aliases 
are concerned, so band cdo not need 
to be marked as restricted pointers. 

In Example 1(b), two of the five calls 
involve undefined behavior because 
some sort of overlap exists that involves 
modified values. The first call works be- 
cause it is just adding arrays x and e to- 
gether and storing them in d. The sec- 
ond call illustrates that contiguous 
portions of arrays can be used, as long 
as they don’t overlap. The third call in- 
volves adding d to itself, but shifted by 
one element. An optimizing compiler 


(a) 


might perform f5 using vector hardware 
that would produce different results, de- 
pending on the size of the vectors and 
the order in which pieces are calculat- 
ed. The fourth call fails because dis be- 
ing aliased. This particular example will 
probably get it right, but that depends 
on exactly what f5 does internally. A 
slightly modified f5 (for example with 
alil= bli]+ cli+1]}) could get the wrong 
answer. The last example works fine be- 
cause the alias is between two pointers 
that are not used to modify data. A draft 
of this proposal is now circulating among 
other standards bodies for comment. 


Array Syntax 

The subgroup examining array syntax 
has undergone a significant transfor- 
mation since the forming of NCEG. The 
original topic was focused on syntax to 
make generating code for vector proces- 
sors easier. Supercomputers like the Cray 
X/MP are designed to perform arith- 
metic on whole blocks of an array at a 
time by providing a separate floating- 
point unit for each element of a vector. 
It is a natural fit, then, for a program- 
mer to be able to write expressions in- 
volving whole arrays at a time. 

In the last couple of years, however, 
emerging interest in massively parallel 
computers has shifted the focus of the 
eroup. In a nutshell, Amdahl’s Law states 
that as you add more processors at- 
tached to a shared memory, you reach 
a point of diminishing returns where 
additional processors block each other 
from access to the memory. Massively 
parallel computers try to solve that prob- 


void f3(int n, float *restrict 4, float +b, float *c) { 


ant ts 
for (i = 6: i 


algal = oli] + elil: 
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float x [100]; 
tTG6at *C; 


i < ni itt) 


void £5(int n, float #restrict a, Float *restrict b) { 


g5(void) { 
float 
ee < 


d[108], e[100]; 


£5 (100. ..d, e); /* Behavior defined. 
£5( 50, d, dt5Q): /* Behavior defined. 
£5( 99, d¥1_ d); /* Behavior undefined. 
c=d; 
£5(100, a, 
£5(10@, 


PLUM HALL INC. 
SUITE 261, POB 111333 
KAMUELA, HI 96743 
lum@plumhall.com 
‘AX +1-808-885-3569 
PHONE +1-808-885-6663 


e); /* Behavior undefined. 
d); /* Behavior defined. */ 
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To understand how C programmers feel about 
CodeCenter, try taking away this guy's Harley. 


There’ a sure-fire way to get a C software engineer to use CodeCenter,™ formerly Saber-C. 
It’s called an evaluation. After which, most become belligerent at the mention of doing without it. 





~-  Isit CodeCenter’s lightning-fast incremental linker which provides instant feedback on 


moana changes that makes engineers so possessive? Or the exclusive automatic run-time error 
cme” = checking which catches the peskiest glitches? 

Or maybe it’s that CodeCenter is the only complete programming environment out there — with 
support for prototyping, editing, testing, debugging and maintenance. 

Perhaps it’s just that CodeCenter carves out more time for you to try new things. 
To be creative. As thousands of engineers have already discovered. 

Judge for yourself. Call 1-800-NOW-CNTR and ask for a free 
evaluation kit of CodeCenter for C or ObjectCenter™ for C. And don't 

Formerly Saber Software 


worry. No one’ going to take it away from you. They wouldn't dare. 10 Fawcett Street, Cambridge, MA 02138 « (617) 498-3000 
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(continued from page 28) 

lem by giving each separate processor 
its own memory. Advances in micro- 
processor technology have made that 
approach economically attractive. Un- 
fortunately, most existing programming 
languages like C were not designed for 
a massively parallel computer. [Editor's 
note: For more discussion on Amdabl’s 
Law, see “Personal Supercomputing: 
Seamless Portability” by Ian Hirschsohn, 
DDJ, july 1992) 

What does it mean to have a pointer 
when there may be 10,000 processors, 
each with 10,000 separate memories? 
Massively parallel computers solve com- 
plex problems by distributing arrays 
across the many memories of the ma- 
chine. Each processor solves the prob- 
lem on its local piece of the array, with- 
out competing with other processors for 
access to memory. The conventional C 


(a) 
shape [10] [1] Sa; 


doublé:Sa. a..:-be% 
£)« 4 


fiyoud ict 
iterator I = 100; 
float a[10@0], b[100]; 


atl) =bli}-# i; 


f (void) -{ 
ane “rs 
float a[100], b[10@]; 





Example 2: (a) Sample use of a 
shape; (b) this assignment can be 
performed 100 times; (c) this code is 
the equivalent of that in (b). 


float bees ate 
long double V4 


We ae hop ee ae a) 





Example 3: The value ofxty is 
computed in at least float precision 
according to the C standard. 
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approach of bumping a pointer through 
an array simply does not work efficiently 
in that environment. 

At the January 1992 NCEG meeting, 
the array-syntax group decided to focus 
on solving this problem. Several exist- 
ing dialects of C have tackled this prob- 
lem, including C* from Thinking Ma- 
chines and MPC from MasPar. The 
array-syntax group will be hammering 
out a compromise that merges the ca- 
pabilities of those dialects into a data- 
parallel C. 

The interest in this group is high and 
a special meeting was held at Thinking 
Machines in April to move the group 
rapidly forward. The group has decid- 
ed to use the C* Reference Manual as 
its base document for future work. This 
decision was difficult to achieve because 
this group has received 14 different sig- 
nificant proposals, whereas the other 
NCEG groups have received only one 
or two proposals each. 

The C* language introduces a new 
ageregate type called “shape” that has 
dimensions like an array, except that el- 
ements are not stored contiguously in 
memory and are usually distributed 
across multiple memories. The C oper- 
ators are overloaded to operate element- 
wise on objects of the same shape. Op- 
erations which are not element-wise or 
involve operands of different shapes re- 
quire the use of a new left-indexing sub- 
script operator. Example 2(a) illustrates 
the use of a shape. 

The arrays band ccan be added to- 
gether as if they were normal C vari- 
ables, but because they have been de- 
clared with a shape, corresponding 
elements of the shape will be added in- 
stead. The left indexing illustrates how 
a single element of a shape can be se- 
lected. Left indexing is rarely needed, 
however, because other operations ex- 
ist that select subsets of a shape. For ex- 
ample, the Where statement above se- 
lects only those positions of the shape 
for which the test is true, and then ex- 
ecutes the block code (incrementing a 
and copying bto co) only for those se- 
lected positions. 

There is still some controversy con- 
cerning whether parallel operations and 
data distribution can be made indepen- 
dent. An alternative proposal has been 
put forward involving a concept called 
iterators to express parallel operations. 
An “iterator” is a new kind of object that 
signals the need for “iteration” when it 
is used in an expression. Example 2(b) 
causes the assignment to be performed 
100 times. An easy way to think about 
iterators is that expressions are execut- 
ed as if a forloop were wrapped around 
the expression using the iterator vari- 
able in the loop. So Example 2(b) can 





be equivalent to Example 2(c). 

Iterators do not address the problems 
of data distribution in massively paral- 
lel computers, but they may be inte- 
grated into a larger proposal that does 
address these issues. 


IEEE Floating-point Support 

Since more and more machines are stan- 
dardizing on JEEE floating-point hard- 
ware, the floating-point subgroup has 
been working to specify how C should 
be implemented on machines support- 
ing IEEE floating point. The most im- 
portant elements of the IEEE floating- 
point standard (ANSI/IEEE 754-1985) 
not found in Standard C are the notions 
of infinity and NaN (not-a-number), 
along with the notion of the floating- 
point environment and a more complete 
computational library. 

An infinity is a value so large in mag- 
nitude that it cannot be represented in 
the range of a floating-point value, so 
it is more than just the mathematical 
concept of infinity. Any nonzero num- 
ber divided by 0 produces infinity, but 
so does a very large number divided by 
a very small number, as long as the re- 
sult overflows the floating-point range. 

A NaN is a mechanism that IEEE de- 
fined to carry information about er- 
roneous conditions that arise during 
computations. For example, trying to 
compute the logarithm of a negative 
number produces a NaN. A program- 
mer need not check hardware error flags 
or C’s errno after every step of a com- 
putation. Once a NaN has appeared in 
a computation, any further expressions 
using that NaN also produce NaN. 
Rather than having errors introduce 
numbers that might be produced by 
normal computations, a NaN is unique- 
ly identifiable as an error condition. 

A floating-point environment is a set 
of dynamic runtime flags that control 
rounding of intermediate results and re- 
porting of exceptional conditions. 

The data types used to hold inter- 
mediate results are another area ad- 
dressed by this group. The current C 
rules demand that at each operator, the 
computation occur with a precision at 
least as wide as the wider of the 
operands, but otherwise leave it up to 
the compiler. One of the alternative eval- 
uation methods being discussed, called 
“widest need,” would widen all values 
to the widest type appearing anywhere 
in the expression, not just at each op- 
erator. In Example 3, the value of x + y 
is computed in at least float precision, 
according to the C standard. Using 
widest need, this addition is computed 
using long double precision because the 
assignment (and thus the multiply and 
the additions) all need long double. 
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(continued from page 30) 

An additional area not demanded 
by the needs of IEEE arithmetic, but 
deemed to be useful in general, is over- 
loaded functions. The IEEE group de- 
fines a new header (fp.h) that contains 
a set of overloaded functions, such as 
sqrt, that use a distinct implementation 
that depends on the types of their ar- 
guments. User-supplied overloading is 
not provided, but C++’s overloading 
could accomplish most of the capabili- 
ties needed. Widest-need evaluation pre- 
sents a unique problem because it is 
supposed to affect the choice of over- 
loaded functions. In effect, C++ over- 
loading only uses the types of the func- 
tion arguments to pick which function 
to call, while widest-need evaluation us- 
es the needed return type as well. 

The IEEE subgroup has produced a 
document that is now circulating among 
other standards groups for comment in 
draft form. As much as possible, the ex- 
tensions have been defined so that any 
floating-point hardware could take some 
advantage of them. 


Complex Arithmetic 
The complex arithmetic group has pro- 
duced a specification for complex data 
types of three precisions: float complex, 
double complex, and long double com- 
plex, representing values in Cartesian, 
not polar, coordinates. 

These types are accompanied by a 
new floating-constant suffix, 7 which 
promotes the constant to have the cor- 


double complex x; 


pi — dpe ie ile 3 I 





Example 4: Assigning a value to x 
with the real part being 3.0 and the 
complex part being 5.0. 


(a) 


void f(int m, float *a) { 
jaa ae eee 


} 
(b) 
void f(int m, float alm] [m]) { 


eati)la] 


} 


(c) 


void £ tint n) t 
float a[nl; 
Je. */ 





Example 5: (a) Indexing; (b) clearer 
indexing; (c) arrays of variable 
dimension. 


a2 


responding complex type. Example 4 
assigns a value to x, with the real part 
being 3.0 and the complex part being 
5.0. This notation very closely mimics 
the notation used by mathematicians. 

The proposal also extends the nor- 
mal arithmetic operators for complex 
types and adds conversion rules for 
combining complex, floating-point, and 
integral types in expressions. 

A new header (complex.h) has been 
proposed that contains prototypes for 
complex functions like complex sines, 
cosines, exponentials, logarithms, pow- 
ers, and square roots. In addition, func- 
tions exist to extract the real and imag- 
inary parts of complex numbers, as well 
as the conjugate of a complex number. 

An issue which has not been com- 
pletely resolved is whether there should 
be a type called “imaginary.” This would 
strictly be the imaginary part of a com- 
plex number. Mathematically, an imagin- 
ary is just a complex number with a ze- 
ro real part. For C, this type would solve 
some exotic issues relating to NaNs. 


Variably Dimensioned Arrays 

The ability to specify varying array 
bounds for multidimensional arrays 
would greatly simplify the code used to 
access such array parameters. For ex- 
ample, a matrix-multiply routine that 
works on two-dimensional square ar- 
rays today must pass those arrays as 
simple pointers and explicitly perform 
the index arithmetic. Clearly, Example 
5(a) indexing would be much more ob- 
vious if it could be written as in Exam- 
ple 5(b). 

There are currently two competing 
proposals. In the “fat-pointer” proposal, 
one can declare pointers to variably di- 
mensioned arrays. These objects are 
pointers that include not only an address 
but also a descriptor for each of the di- 
mensions of the array being pointed at. 

The alternative proposal allows vari- 
able dimensions on array parameters 
to functions as a way to simplify writ- 
ing library routines for multidimen- 
sional arrays. 

Both proposals now allow automat- 
ic arrays of variable dimension for more 
flexible local storage; see Example 5(c). 

Some implementations of C provide 
a nonstandard function alloca that al- 
lows one to allocate auto storage that 
is automatically recovered on function 
return (usually done by directly ma- 
nipulating the hardware stack pointer). 
This function is not possible on some 
hardware, however, which is why it is 
not part of the Standard. Variable-length 
automatic arrays have the same prob- 
lems. The proposal that allows auto- 
matic variable-length arrays allows an 
implementation to use malloc to allo- 





cate space for the object. As a result, 
setimp and longimp may not work prop- 
erly around variable-length arrays. 


Exceptions and errno 

The specification of Standard C requires 
that errno be set when certain errors 
occur in math functions. This mecha- 
nism is not the approach that IEEE has 
advocated for signaling error conditions. 
A number of hardware implementations 
are significantly slowed down by the 
need to set errno correctly. 

Not coincidentally, the IEEE floating- 
point support <fp.h> math functions 
specifically do not have to set errno. 

At this point a specific proposal has 
not been formulated. The general feel- 
ing is that the support for errno may 
not be present in an extended math li- 
brary, which will perhaps use the IEEE 
floating-point environment instead. 


Aggregate Initializers 

Aggregate initializer extensions have 
been proposed to solve two slightly dif- 
ferent problems. First, programmers 
must be able to initialize selected ele- 
ments of an array or structure. Second, 
they must be able to specify an initial- 
izer as a constant in an expression. 

A proposal has been accepted that al- 
lows either an array subscript expres- 
sion or a structure member name to ap- 
pear in an aggregate initializer. This 
selector may appear before any indi- 
vidual value in the initializer. That val- 
ue is then assigned to the named ele- 
ment or member, and subsequent values 
are assigned to consecutive elements or 
members after that. 

Initializing selected elements of an ar- 
ray or members of a structure simplifies 
several different situations. In Example 
6(a), the div_t type is specified as part 
of the standard as having two members, 
quot and rem, for the quotient and re- 
mainder of a division, respectively. But 
which member comes first? With the 
above initializer, it doesn’t matter. 

For an array that must be initialized 
with the same elements as an enumer- 
ator, the code in Example 6(b) can be 
used. You can be confident that in the 
table, the strings will correspond to the 
correct array elements, even if you lat- 
er add more enumerators or change 
their values. 

Suppose you have an array that on- 
ly needs nonzero entries near each end 
of the array, as in Example 6(c). Sub- 
scripting and member selection can be 
combined; see Example 6(d). 

Both wand z are initialized to the 
same values. The initializers are both 
inconsistently bracketed, but in the first 
case, it is clear which elements are be- 
ing set to which values. This extension 
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(continued from page 32) 
can even be used to initialize a union; 
see Example 6(e). 

There has also been some sentiment 
for supporting initialization for ranges 
of array elements. Often, you wish to 
set all the elements of an array to some- 
thing other than 0. Several suggestions 
have been made for possible syntax, 
but nothing has yet been accepted. 
There are other extensions that may al- 
so need to express ranges of values, so 
it is likely that once some notation can 
be agreed upon for expressing ranges, 
they will be adopted wherever appro- 
priate. 

Another area that has an accepted 
proposal provides the ability to con- 
struct aggregate constants in expressions; 
see Example 6(f). In the second call, an 
ageregate constant is used to avoid cre- 
ating an extra variable. Nonconstant run- 
time values can appear in the aggregate 
constants. 


(a) 


Extended Integer Range 

Support for extended integer range is 
the newest addition to the work items 
of NCEG. Numerous hardware vendors 
either offer machines with 64-bit inte- 
gers, or are planning to introduce such 
machines. C programmers have always 
thought of char, short, and long integers 
as being 8, 16, and 32 bits wide. Stan- 
dard C requires that each of these types 
be at least that wide. These new ma- 
chines with 64-bit integers present some 
problems. If such a machine makes long 
64 bits wide, any software that assumes 
that long is exactly 32 bits wide will fail. 
Similarly, if int is made 64 bits wide, soft- 
ware that assumes int is either 16 or 32 
bits wide will fail. Probably most trouble- 
some of all, on machines that support 
64-bit integers, programmers will still 
want 8-, 16-, and 32-bit integers as well. 
C can make char, short, int, and long 
each into distinctly sized integers, but 
what happens when—some day—128- 


div_t answer = { .quot = 1, .rem 


(b) 


enum etag { Meml, Mem2, /* ... 


const char *nmi] = € [Mem2|:=°"Mem2", 


(c) 


int a[MAX] 


Ue 
i 
(d) 
Struct ft. ie 


strict: :&. willie 
struct s z[] 


(e) 


union utag { /* ... 


(f) 


struct. point {int 2)-y J. 


a A 


[Memi] = "Mem1", 
ee 


{ .any_member = 42 }; 


void drawCircle(struct point center, int radius) ; 


struct point p; 


Pee = ths pee S12; 
drawCircle(p, 15); 


drawGircle(.(etruct: point) { It, dei 2s): 





Example 6: (a) The div_t type is specified as part of the standard as having 
two members; (b) initializing an array with the same elements as an 
enumerator: (c) an array that needs nonzero entries near each end of the 
array; (ad) combining subscripting and member selection; (e) using an 
extension to initialize a union; (f) constructing aggregate constants in 


expressions. 
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bit integers appear. 

As it happens, when surveys of actu- 
al C code have been made, it turns out 
that the biggest problem is with con- 
figurations in which the size of int dif- 
fers from the size of pointers. After years 
of telling programmers that ints and 
pointers are not interchangeable, sig- 
nificant amounts of code still make that 
assumption. And although we may con- 
demn such code, we must accept that 
it exists and that customers will not be 
happy if they are forced to convert. 

Because of these difficulties, some im- 
plementations have decided to create a 
new type, “long long,” that is used for 
64-bit integers. The members of NCEG 
feel strongly that “long long” is a very 
poor design choice. The extended- 
integer range group has decided that 
despite disapproving of “long long” as 
a new integral type, they will produce 
some guidelines as part of the techni- 
cal report. These guidelines will point 
out practical portability issues involved 
in defining a “long long” type. 

Given the strains of incorporating an 
additional integer size into the C type 
system, there is a strong desire among 
group members to find a more general 
solution. The solutions suggested have 
centered around being able to declare 
the minimum size of an integer. Inte- 
gers could be specified as being some 
minimum number of bits, or else as a 
range of values. In Example 7, for in- 
stance, the declarations would make x 
a variable that is a signed integer at least 
32 bits wide. The type of y is some in- 
teger large enough to hold the values 
from 1 through 100. This capability is 
similar to the ability to define integer 
ranges in Pascal. 

While the work is still very prelimi- 
nary, the type system of C is not likely 
to be as strict as that for Pascal. Integer 
ranges in C would simply be synonyms 
for one of the supported integral types. 
Significant work is still needed on con- 
version rules, along with the exact 
meaning of these declarations. 

One question still unanswered is 
whether a programmer will only be able 
to specify types that are at least as wide 
as desired. Some wish to specify that 
they want an integer exactly N bits wide. 
This is analogous to the existing bit- 
fields of C, except perhaps allowing ar- 
rays of these new integer types. 


Conclusion 
There has been a subtle shift in the 
standards-making process since the be- 
ginning of the ANSI C committee in 
1983. At that time, a standards com- 
mittee was assembled after a language 
had been in use for some time and a 
(continued on page 39) 
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code compiler for Prolog now 
supports Windows, with direct ] 
access to the entire Windows API, | 
including all the new 3.1 features 
and supplementary DLLs. Easier | 
than ever to interface to other 
languages, with ready-to-use 
bindings to Paradox Engine, 
QELIB, Oracle and XVT. Includes 
special version of SLR’s Optlink/ 
Windows. 


List: $599 PS Price: $589 
FastFaxts: 2135-001 
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SlickEdit by MicroEdge 


Powerful, programmable editor 

emulates BRIEF and Epsilon. Includes 
macro language, undo/redo up to : 
32,000 steps, multiple windows, on-line 
help, multiple clipboards, compiler error | 
message processing, procedure q 
tagging, programmable file manager, 
and command retrieval. 


DOS/OS/2: 

List: $ 195 PS Price: $189 
FastFaxts: 1101-001 

Windows NT 386 or mips List: $195 
—call Micro Edge for more information 
Unix/Xenix single user: 

List: $295 PS Price: $279 
(cpu license) for Unix/Xenix, Sun OS, 
AIX RS6000, Motorola 88k, DG Aviion, 
Silicon Graphics, or HP9000: 

List: $425 PS Price: $415 


FastFaxts: 1101-005 j 
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Q+E Database Library 
(QELIB) 


by Pioneer Software 
Don’t rewrite code just to use a 
different database! QELIB and 
your favorite development tool 
can build applications that 
simultaneously and identically 
access, query, and edit data in 
fourteen formats: DB2, Oracle, 
SQL Server, Sybase, Netware 
SQL, Ingres, XDB, SQL Base, 
Btrieve, OS/2 EE DBM, Paradox, 
dBASE, Excel .XLS files, and 
text files. 


List: $399 PS Price: $379 
FastF axts: 2137-012 


High C/C++ 
by MetaWare Inc. 


MetaWare Incorporated announces its 
newest product. The 32-bit High C/ 
C++ compiler. High C++ is a true 
compiler, not a C to C++ translator. 
“Incremental Strengths” lets you 
specify the level of C++ compilation, 
allowing you to migrate from C to C++ 
one C++ block at a time. Included in 
the package is a C++ tailored source- 
level debugger, and a 32-bit Windows 
Application Development Kit. 
MetaWare offers a full line of multi- 
language, multi-platform compilers for 
professional software developers. 


List: $795 PS Price: $749 
FastFaxts: 0089-038 
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Smartcom II® for the 
Macintosh® 


by Hayes Microcomputer 
Smartcom II® for the Macintosh® 
Praised by software reviewers and 
users as the most reliable, easiest to 
use Mac communications software 
available. Smartcom II supports 7 
System 7, XMODEM, YMODEM and if 
ZMODEM, and all popular modems 
plus the advanced features of Hayes 
Smartmodem and V-series products. 
Other features include support for 
MultiFinder®, Autopilot sequences to 
automate communications, and 
Interactive Graphics. 


LIST: $149 PS Price: $109 
FastFaxts 1952-025 


DoDOT 4 


by Halcyon Software 


DoDOT version 4 is the complete 
graphics toolbox for Windows 
users. Graphics file conversion/ 
viewing/printing, screen capture, 
bitmap-to-vector, vector-to-bitmap 
(Trace), color image processing 
and editing, scanner input, FAX 
output, and many more, all 
integrated into one easy-to-use 
environment. It also catalogs 
graphics files with preview images 
(Thumbnails) for quick access. 
Supports over 50 vector (Draw) 
and bitmap (Paint) file formats. 


List: $189 PS Price: $175 
FastFaxts: 2942-001 


























































































The SoftC Database Mie anaabearinneee 
° “ y Blue Sky Software 
Librar | WindowsMAKER Professional is | 
by SoftC o considered the AE Rare ou Visual @ 
: : te aug Prototyper and C/C++ Code Generator @ 
me nC e the sneaks oe of ; i , for Windgws. New architecture uses : 
choice tor protessiona ++ & : -, Switch-It Code Generation Modules for/ 
Visual Basic developers. It inte a. generating ANSI C, MFC C++, OWL : 
provides fully compatible, multi- mm VY C++ code, among others. Design | 
user access to FoxPro 2, dBASE 3 Vedi Poone Lege eee aie : 
IV, and Clipper data, index&  §$(@ | | \]]siiN | box editor let you test the look & feel | 
memo files. The library source is i and make changes on the fly. 
written in 100% C for portability a q _ TrueCode technology ensures that user 
(DOS, Windows, OS/2, and Bm 3, — Togeneration, Generates the Windows 
UNIX). Executables are small, | fe .EXE with fully commented C or C++ 
fast & reliable. Full source code source code and production files. CUA @ 
and Windows DLL included. : ig 4 & SAA sear Incl. ee q 
. . a No royalties. Highly recommended. 
List: $195 PS Price: $179 q _ List: $095 PS Price: $895 
FastFaxts: 5306-001 . | . __ FastFaxts: 2001-014 
WATCOM C9.0/386 ee 
by WATCOM y stratos W are Corp. 
Far and away the best debugger 





Develop and debug 32-bit 
applications for extended DOS, 
Windows and OS/2 2.0. Includes 
royalty-free 32-bit DOS extender, 
true 32-bit Windows GUI 
Application Kit, our fast, tight, and 
reliable 32-bit Code Optimizer, 
licensed Microsoft Windows SDK 
Components, an interactive 
Source-Level Debugger, an 
Execution Profiler and More! Now 


in its class, MemCheck offers a 
complete solution to memory 
allocation errors. Pop-up 
messages pinpoint overwrites, 
memory leakage, null pointer 
assignments, invalid frees, and 
other serious errors, complete 
with line and file information. 
Configure MemCheck on any 
project in 15 minutes or less with 
no changes to your source code. 





includes OS/2 2.0 support. Please specify compiler when 
LIST: $895 PS Price: $739 ordering. 
FastFaxts 1044-029 List: $140 PS Price: $125 


FastFaxts: 3959-001 
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Your Data 
Takes the strugge out of data conversion! Get your data where you wnat 
it — how you want it — without programming! by Tools & Techniques 


@ Convert to & from scores of databases, spreadsheets and : 
—any ASCIl (fixed, delimited, report, downloads), 
—Binary/EBCDIC/mainframe files with packed decimal, floats, etc. 
—Btrieve, Netware SQL, c-tree,C-ISAM & COBOL files, 
—Accounting apps (Platinum, DAC, Solomon & many more), and 
—Stat packages (SAS, S-Plus, SPSS). 

@ Convert & clean up mailing lists or any structured data file. 

@ Select fields/records; split/merge fields; search/replace 

m@ Parse Name, Address, Phone# and ZIP fields automatically 

@ Completely menu driven! Use script files for easy automation! 

"...€@ powerhouse among data conversion packages.” Lotus Magazine 

Professional List: $ 199 PS Price: $175 

Advanced List: $299 PS Price: $269 

FastFaxts: 1377-001 


BLINKER 2.0 
by Blink, Inc. 


Fastest dynamic overlay linker for C, 
C++, ASM, BASIC, Clipper, 
QuickBASIC, Fortran, etc., with new 
integrated memory swap function 
designed to save time and memory. 
Automatically creates fast, stable 
overlaid .EXEs to reduce memory 
requirements. Uses XMS/EMS to 
save currently executing program 
and run a second within the first. 
Features CodeView support and 
overlay caching to XMS/EMS for 
optimum runtime performance. 


PS Price:$279 


WordPerfect 
by WordPerfect 


WordPerfect gives you text/ 
graphics integration, tables, pull- 
down menus, mouse support, 
equation editor, spreadsheet links, 
labels, mail merge, context- 
sensitive help,spell checker/ 
thesaurus, macros, dictionary- 
based hyphenation — plus toll-free 
support. 


LIST: $495 PS Price: $279 
FastFaxts 1933-014 


PRODUCT 

Norton Desktop for Windows 
Lotus 1-2-3 v3.1 

Excel 4.0 

QuattroPro 

WordPerfect 

WordPerfect (Windows) 
Microsoft Word 


PARADE OF Microsoft Word (Windows) 


MS DOS 5.0 upgrade 
PRODUCTS | Borland C++ 

Microsoft C/C++ 7.0 

Clipper 

dBASE IV 1.5 

FoxPro 2.0 

Windows 3.1 


What is FastFaxts? : 
Get literature on any of our more than 10,000 products via 
fax machine 24 hours a day, 7 days a week. FREE! 

Call: 617-740-0025 from any fax machine! 

Follow the voice computer’s instructions and enter your 
product’s code number. Product literature prints out of your 
fax machine instant 
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Janus/Ada x86 Compiler 
by R. R. Software 


Janus/Ada for DOS: the power 
and reliability of Ada, priced for 
everybody! You get a full 
implementation of the Ada 
standard — compiler, linker, 
royalty-free runtime libraries, 
multiple memory models, 
environment, tools, even an Ada 
tutorial. And R.R. Software 
supports you with 11 years of 
Ada experience and know-how. 
Now's the time for you to use 
Janus/Ada. 


LIST: $129 PS Price:$99 
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InstallBoss 3.5 
by Kedwell Software 


Installation for program generator 
for MS-DOS based computers. It 
generates professional installation 
programs and procedures for your 
software application. InstallBoss is } 
menu-driven and uses a 
compression utility, and checks | 
Config.sys and Autoexec.bat files. | 
Prepares distribution diskettes for | 
easy installation. 


LIST: $170 PS Price: $149 
FastFaxts 5277-001 


by Wordtech 


ARAGO dBXL is a fully dBASE IV 
compatible database management 
system with unmatched performance 

and ease of use. It is exceedingly fast 
and features an elegant CUA compliant 
user interface with extensive HELP 
facilities, aTest Coverage Analyzer,a_ | 
Panel Painter and numerous other tools | 
for both first-time users and advanced | 
developers. ARAGO dBXL is also 
compatible with dBASE III+ which will 
allow you to get started immediately i 
running millions of existing applications! | 
Requirements: DOS 3.1 or higher, IBM § 
PC or better, hard disk, 640K RAM. 


LIST: $699 PS Price: $599 


FastFaxts 971-039 


PRODUCT 

Windows 3.1 upgrade 
Norton Utilities 

Visual Basic 1.0 
Carbon Copy Plus 6.0 
ProComm Plus 

xtree gold 2.5 


Norton PC Anywhere 4.5 


Stacker 
BTrieve 
-.RTLINK Plus 5.1 
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ObjectVision 2.0 
QEMM 
DesqView 

ACT! 
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NCEG 


int: 32 A 
100} y; 


Int: 1... 





Example 7: The declarations make x 
a variable that is a signed integer at 
least 32 bits wide. Type y is some 
integer large enough to hold the 
values from 1 through 100. 


(continued from page 34) 

de facto standard had been determined 
by the marketplace. The standards com- 
mittee was charged with ironing out the 
differences between implementations 
and writing a clear specification for all 
the dark corners. 

Nowadays, standards committees are 
being used to create standards from 
scratch. Companies are using these neu- 
tral forums as a way to arrive at sensi- 
ble solutions without having to risk years 
of possibly expensive conflict between 
competing products. The Beta/VHS bat- 
tle is an obvious example from the con- 
sumer-electronics industry of the costs 
of letting the market pick winners. 

The NCEG group is part of the new 
spirit in standards. Competing vendors 
can come together and share their ex- 
pertise before they invest large sums of 
money in building C compilers and be- 
fore customers invest huge sums of mon- 
ey in competing designs. The custom- 
ers are the ultimate winners. They can 
buy a C compiler that supports an NCEG 
extension and feel confident that they 
can port that code to another vendor. 

Will C, with the addition of the NCEG 
extensions, replace Fortran? The NCEG 
committee does not see this as the goal 
of the group, nor is it likely to be ac- 
complished. Programmers continue to 
use Fortran for reasons that have noth- 
ing to do with whether it is the best lan- 
guage in the world. Decades worth of 
code have built up in Fortran that no 
one wants to convert to C. Programmers 
with decades of experience aren’t will- 
ing to change languages easily. 

What will emerge from the NCEG ex- 
tensions is a better C for those who want 
to use it for numerical programming. 
The data-parallel extensions promise the 
possibility that C will become one of 
the first truly portable languages avail- 
able for massively parallel supercom- 
puters. If that happens, C will be lead- 
ing the way into the 21st century and 
the brave new world. 
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ImageMan/X - the ult 
export tools. With ImageMans. 
application can export TIFF, P 

GIF, TARGA, and BMP files wit _ 
unbelievable ease. You have complete : 
control over all aspects of file creation, — 
including compression type and color a : 
makeup. : 


$3 18995. 


ImageMan tn an easy to use 
Visual Basic control. You 
can display and print 
images with the same 
amazing speed of the 
original ImageMan, plus gain built-in 
support for panning, zooming, and 
other image manipulation functions. 


$249 


For the very best in imaging tools, call 
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It's true! You can write your 
application just once in C or 
C++, and link it with an XVT 
library to get a native 
application for the GUI system 
of your choice. Not only have 
we been shipping XVT for 
over 3 years, but our 
customers are shipping their 
applications, too! Call today 
for full details, including a 
free, twenty-page XVT 
Technical Overview. 


XVI Software, Inc. 

Box 18750, Boulder, Co. 
80308 USA 

303 443-4223 

FAX 303 443-0969 





SOFTWARE INC 


AV is a trademark of XVT Software, Inc. 
Motif, OPEN LOOK, Macintosh, MS-Windows, 
and OS/2-PM are trademarks of their vendors. 





You can 
develop 
portable 


GUI 
applications 


with XVT! 


for: 
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omputer arithmetic gets better 
and faster all the time. Once you 
could only add 8-bit numbers, 
then 16 bits became the stan- 
dard, now 32 bits, soon 64.... 

But what if you want to add, subtract, 
multiply, or divide 512-bit numbers? Few 
computers have machine-language in- 
structions for big-number arithmetic and, 
for obvious reasons, even fewer pro- 
gramming languages have built-in op- 
erations supporting it. 

The ability to manipulate 512-bit (or 
larger) numbers is necessary, for in- 
stance, when you're implementing math- 
ematically oriented encryption schemes; 
both the RSA and DSS schemes involve 
such numbers. (For more-detailed in- 
formation on RSA and DSS, see the side- 
bar entitled, “Public-Key Cryptography 


Meets the Real World” on page 22 of | 


the May 1992 DDJ.) So the question aris- 
es, how can you operate on 512-bit 
numbers in a language such as C that 
only goes as far as 32 bits? 

For the answer, we can look at one 
simple implementation of what is called 
“multiple-precision” (MP) arithmetic. MP 
arithmetic has been implemented in 
RSA Laboratories’ cryptographic tool- 
kit, RSAREF, and the code has been port- 
ed to many machines without modifi- 


Burt is chief scientist of RSA Laborato- 
ries, a division of RSA Data Security. He 
received a PhD in computer science from 
MIT in 1988 and is interested in cryp- 
tography and fast arithmetic techniques. 
He can be contacted at burt@rsa.com 
and at 10 Twin Dolphin Drive, Redwood 
City, CA 94005. 
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Multiple-precision 


Arithmetic in C 


Big jobs require big numbers 


Burton S. Kaliski, Jr. 


cation. (RSAREF is available to U.S. and 
Canadian citizens at no charge for non- 
commercial use; contact RSA Laborato- 
ries, 10 Twin Dolphin Drive, Redwood 
City, CA 94065 or at rsaref @rsa.com.) 


Representing MPs 

MPs are represented as arrays of type 
NN_DIGIT, where NN_DIGIT depends 
on the machine. For instance, NN_DIG- 
IT might be defined as an unsigned 
long, which has 32 bits on most ma- 
chines. For technical reasons, the num- 
ber of bits b must be even. Each ele- 
ment of the array is a digit in the base 
B representation of the MP, where B=2?. 
The minimum digit is 0 and the maxi- 
mum digit is B-1. 











Lower-indexed elements of the array 
are less significant than higher-indexed 
elements. We call a/O/the 1s digit of ar- 
ray a, all] the Bs digit (similar to 10s 
digit), a/2/the B?s digit (similar to 100s 
digit), and so on. 

For example, the ninth Fermat num- 
ber, 2?!4+1, would be represented as an 
array of 17 32-bit NN_DIGITS: 


Listing One (page 116) defines NN_ 
DIGIT and some other items helpful in 
handling MPs, including function pro- 


totypes. 


MP Tools 

We start with four tools: C’s built-in ad- 
dition, subtraction, multiplication, and 
division operators. The tools let us: 


e Add two NN_DIGITs, and get the 1s 
digit of the sum (but not the carry-out). 

e Subtract an NN_DIGIT from an NN_ 
DIGIT, and get the 1s digit of the re- 
mainder (but not the borrow-out). 

e Multiply two NN_DIGITs, and get the 
1s digit of the product (but not the Bs 
digit). 

e Divide an NN_DIGIT by an NN_DIG- 
IT, and get the quotient, also an NN_ 
DIGIT. 


MP operates on top of these tools. 
Adding and subtracting MPs is pretty 
easy; multiplying them is harder, and 
dividing is hardest. We tackle the prob- 
lems in that order. We stop along the 
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Powerful New ‘Tools for OS/2 Programming 


WorkFrame/2 software ... because the best 
environment for application development is the 
one you create yourself. 


With WorkFrame/2 software, you can integrate your choice of 
development tools — including DOS and Windows tools. It’s 
open, configurable and language independent. And it’s easy to 
customize the WorkFrame/2 interface to create your own 
development environment. 


The concept is simple. The WorkFrame/2 program organizes 
files into logical units called projects. By associating each 
project with your personal choice of compiler/debugger/ 
linker/editor you can get the greatest productivity possible 
from all your development tools. 


C Set/2 program ... because application 
development should be fast - and simple! 


C Set/2 program delivers a one-two punch to help you 
create some of the highest-performing OS/2-based 
applications possible. 


Firstly, the 32-bit C compiler enables your applications to 
exploit the speed and power of 386- and 486-based 
computers. It’s the best high-performance code optimizer in 
the business. With the C Set/2 compiler, unsafe optimizations 
simply don't exist. 


Secondly, the C Set/2 program comes with a fully interactive, 
full-function, source-level 32-bit Presentation Manager 
debugger. Just point your mouse and shoot, using the 
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IBM, C Set/2, OS/2, WorkFrame/2 and Presentation Manager are trademarks of International Business Machines Corporation. Windows is a trademark of Microsoft Corporation. 





graphical PM user interface — or use the keyboard. kither way 
it’s easy. And you'll get instant feedback on the screen to 
verity what you're doing. Debugging has never been so simple! 


And there’s more. C Set/2 compiler conforms to industry 
standards - including ANSI C and ISO/IEC — and offers 
Microsoft C compatibility. With features like a full suite of 
run-time libraries and 32-16 bit linkage, you can be sure 

C Set/2 program will provide the function and flexibility you 
need to make application development fast and simple - the 


way it should be. 


OS/2 2.0 Developer’s Toolkit... 
because it takes the right set of tools 
to build powerful applications. 


OS/2 2.0 Developer’s Toolkit is the perfect companion to 
use with C Set/2 programs. It contains a variety of language- 
independent application build and productivity tools. For 

C Set/2 compiler, the Toolkit provides the system linker 
and system header files. It also contains the import libraries 
and the NMAKE utility you need to dramatically increase 
the capabilities of the C Set/2 compiler to build powerful 
applications. | 


For more information on how IBM application development 
tools can work in your OS/2 environment. 


call 1-800-342-6672. 


IBM ... Making good things happen 


in application development. 








(continued from page 40) 
way to add tools for multiplying and di- 
viding NN_DIGITs. 


First Step: MP Addition 

In grade school you were taught that to 
add two numbers, you wrote them 
down, one above the other, then added 
columns of digits from right to left. You 
wrote down the 1s digit of the sum at 
the bottom of the column. If the sum 
had a 10s digit, you wrote it down at 
the top of the column to the left. This 
was the “carry,” and the last carry be- 
came the leftmost sum digit. 

Adding two MPs is much the same: 
We have a carry-in that’s either 0 or 1 
and two addend digits; we want a sum 
digit and a carry-out. But it’s harder to 
get the carry-out than in the grade- 
school method because our tools only 
let us see the 1s digit of a sum. So let’s 
take it a step at a time. 

We first add the carry-in to the first 
addend digit, and look at the 1s digit of 
the sum. Let’s call this the “subsum”. 
Here’s where we do a “twist” on the 
grade-school addition. We don’t know 
directly whether there is a carry. But we 
do know that if there is, then the sub- 
sum must be less than the carry-in, be- 
cause the real sum has wrapped past 
the maximum digit—but it can’t get as 
far as the carry-in. (We detect carries 
this way throughout our MP imple- 
mentation.) 

So if the subsum is less than the car- 
ry-in, we have to carry out. We also 
know that the carry-in is 1, not 0 (oth- 
erwise we couldn’t have carried out); 
and the subsum is 0 (since it’s less than 
the carry-in). Since the subsum is 0, we 
write down the second addend digit as 
the sum digit, and go on to the next 
digit. 

If we don’t carry out right away, we 
next add the subsum to the second ad- 
dend digit and look at the 1s digit of 
the sum, which we write down as the 
sum digit. If the sum digit is less than 
the second addend digit, we have to 
carry out. Otherwise, we don’t. Figure 
1 gives an example of this twist on 
grade-school addition. The first row 
(1s and Os) are the carries; the third is 
the subsum. In the 1s column, the sub- 
sum is more than the carry, and the 
sum digit is more than the second ad- 
dend digit, so there’s no carry out. In 


11100 ¢carry. 
9876 €< first addend 


0976< subsum 
5 43 2 — second addend 
15308< sum 





Figure 1: Grade-school addition with 
a twist. 
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the 10s and 100s column, the subsum 
is more than the carry, and the sum 
digit is less than the second addend 
digit, so there is a carry out. In the 
1000s column, the subsum is less than 
the carry, so there is a carry out. The 
twist is that we can do everything with 
an addition operation that gives only 
the 1s digit of its result. 


How can you 
operate on 512-bit 
numbers in a 
language such as C 
that only goes as far 
as 32 bits? 


The procedure NN_Add (Listing Two, 
page 116) adds MPs with our limited 
tools. The addends are b and c, and the 
sum is d. They all have length digits. In- 
dex variable i moves through the digits 
from right to left, with carry as the car- 
ry-in and -out. The procedure returns the 
final carry-out. 

NN_Add adds MPs with a loop 
around a simple three-part conditional. 
The first part computes the subsum. If 
there’s a carry out, the first part sets the 
sum digit to the second addend digit, 
and stops. The second part computes 
the sum digit. If there’s a carry-out, the 
second part sets carry to 1. The third 
part sets carry to 0 when there’s no car- 
ry-out. 

The sum can share memory with the 
addends, since NN_Add stores inter- 
mediate results in the variable ai. (The 
sum must share exactly the same mem- 
ory; if it overlaps only partially, the re- 
sult is undefined.) All procedures de- 
scribed here have the shared-memory 
feature. 


MP Subtraction 

Subtracting two MPs is just like adding 
two MPs, except that we borrow instead 
of carry. Here, we have a borrow-in 
that’s either 0 or 1, a subtrahend digit, 
and a minuend digit (how’s that for ter- 
minology!); we want a remainder digit 
and a borrow-out. 

We first subtract the borrow-in from 
the subtrahend digit and look at the 1s 
digit of the remainder. We call this the 
“subremainder.” If the subremainder is 
more than the maximum digit minus the 
borrow-in, we have to borrow out. We 
also know that the borrow-in is 1, and 
the subsum is the maximum digit. (This 


is just like addition, except everything’s 
reversed: “Less” is “more,” “plus” is “mi- 
nus,” “0” is “maximum digit.”) 

Since the subremainder is the maxi- 
mum digit, we write down the maxi- 
mum digit minus the minuend digit as 
the remainder digit and go on to the 
next digit. | 

If we don’t borrow out right away, we 
next subtract the minuend digit from the 
subremainder and look at the 1s digit of 
the remainder, which we write down as 
the remainder digit. If the remainder dig- 
it is more than the maximum digit mi- 
nus the minuend digit, we have to bor- 
row out. Otherwise, we don't. 

The procedure NN_Sub, shown in 
Listing Three (page 117), subtracts MPs. 
The subtrahend is b, the minuend is c, 
and the remainder is a. As in NN_Add, 
they all have length digits, and the in- 
dex variable 7 moves through the dig- 
its. The variable borrow is the borrow- 
in and -out. The procedure returns the 
final borrow-out, which is 1 when the 
subtrahend is less than the minuend. 


More Tools 

Before going on to MP multiplication 
and addition, we need two additional 
tools. These add to those built into the 
C language, and let us: 


e Multiply two NN_DIGITs and get a 
two-NN_DIGIT product with a 1s dig- 
it and a Bs digit. 

e Divide a two-NN_DIGIT dividend with 
a 1s digit and a Bs digit by an NN_ 
DIGIT and get the quotient, also an 
NN_DIGIT. 


We need another type for these tools: 
NN_HALF_DIGIT, which has half as 
many bits as NN_DIGIT. (This explains 
the technical constraint that the number 
of bits in an NN_DIGIT must be even.) 
The type depends on the machine; it 
might be an unsigned short, which has 
16 bits on most machines. The mini- 
mum half digit_is 0 and the maximum 
half digit is VB —1. NN_HALF_DIGIT is 
defined in Listing One. 

(These extra tools are in the instruc- 
tion sets of most machines, even when 
NN_DIGIT is 32 bits, but they’re not in 
Standard C. So if you’re working in as- 
sembly language, your work is done. 
Read on to see how to do it in C.) 


Digit Multiplication 

We multiply two NN_DIGITs by com- 
bining four NN _HALF_DIGITXNN_ 
HALF_DIGIT multiplications. We can 
multiply two NN_HALF_DIGITs with the 
tools already built into the C language; 
the product is an NN_DIGIT. (The C 
language’s type-conversion rules require 
that we cast the NN_HALF_DIGITs to 
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(continued from page 42) 

an NN_DIGIT to get an NN_DIGIT 
product. A clever compiler will notice 
the casting and generate an NN_HALF_ 
DIGITXNN_HALF_DIGIT machine-lan- 
guage multiplication.) 

The procedure NN_DigitMult, shown 
in Listing Four (page xx), multiplies 
NN_DIGITs. The multiplicand and mul- 
tiplier are b and c, and the product is a 
(a two-NN_DIGIT array). 

A familiar algebraic formula describes 
the mapping to four multiplications. Let 
bHigh, bLow, cHigh, and cLow denote 
the high-order and low-order halves of 
band c, so that 


b=bHighx\VB+bLow ; 
c=cHigh xVB+cLow 


Then we have: 


bxc=bHighxcHighxB+(bHighxcLow) 
+(bLowxcHigh)VB+bLow x cLow 


NN_DigitMult computes the four mul- 
tiplications and stores the product of 
low-order halves in a/O/, the product of 
high-order halves in a/1/, and the two 
cross products in tand u. 

The low-order halves of the cross 
products need to be added into the 
high-order half of a/O/. Similarly, the 
high-order halves need to be added in- 
to the low-order half of a/7/ 

NN_DigitMult adds the intermediate 
values t and uw. If the sum is less than 
u—there is a carry out—the procedure 
adds 1 to the high-order half of a/1/. 
NN_DigitMult then adds the low-order 
half of t+u to a/O/, carrying into a/1/. It 
finally adds the high-order half of t+u 
to af1/. 


Digit Division 
We divide a two-NN_DIGIT dividend 
by an NN_DIGIT divisor by combining 
two NN_DIGITxNN_HALF_DIGIT divi- 
sions, four NN _HALF_DIGITXNN_ 
HALF_DIGIT multiplications, and some 
other arithmetic. We can divide an NN_ 
DIGIT dividend by an NNHALF_DIGIT 
divisor with the C-language tools; the 
quotient is an NN_HALF_DIGIT, as- 
suming the high half of the dividend is 
less than the divisor. (It’s worth men- 
tioning that we’re interested in the 
“floor” of the quotient—the greatest in- 
teger not more than the quotient.) 
The procedure NN_DigitDiv, shown 
in Listing Five (page 117), divides a two- 
NN_DIGIT dividend by an NN_DIGIT 
divisor. The dividend is b (a two-NN_ 
DIGIT array), the divisor is c, and the 
quotient is a. The two-NN_DIGIT vari- 
able t holds a copy of the dividend. 
NN_DigitDiv assumes that the quotient 
is not more than the maximum digit. 


We guarantee this later when we call 
NN_DigitDiv. 

NN_DigitDiv follows the estimate- 
and-correct method. First, it estimates 
the high-order half of the quotient with 
an NN_DIGITXNN_HALF_DIGIT divi- 
sion. It then subtracts the product of the 
estimate and the divisor from the divi- 
dend. It finally corrects the estimate by 
subtracting the divisor (shifted left a half 
digit) from the dividend, until the divi- 
dend is less than the divisor (shifted left 
a half digit). Next, NN_DigitDiv esti- 
mates the low-order half of the quotient, 
given what remains of the dividend; it 
subtracts the product of the estimate 
and the divisor from the dividend, and 
it finally corrects the estimate. Usually 
there’s only one correction for each half, 
so NN_DigitDiv gets the correct quo- 
tient pretty fast. 

NN_DigitDiv estimates the high-order 
half of the quotient by dividing #/7/ by 
the high-order half of c, plus 1. if the 
high-order half of c were the maximum 
half digit, division would be by VB, 
which is too large, so the estimate is just 
the high-order half of #/7/.) 

NN_DigitDiv subtracts the product of 
the estimate aHigh and the divisor, shift- 
ed left a half digit, from ¢ Then it sub- 
tracts the divisor, shifted left a half dig- 
it, and increments the estimate until 
what remains of f is less than the divi- 
sor shifted left a half digit. 

The assumption that the quotient is 
not more than the maximum half digit 
ensures that the estimate doesn’t exceed 
the maximum half digit. 

NN_DigitDiv then estimates the low- 
order half of the quotient by similar 
means. It divides the low-order half of 
i[1/, shifted left a half digit, plus the high- 
order half of t/O/, by the high-order half 
of c, plus 1. (If the high-order half of c 
is the maximum half digit, the estimate 
is just the low-order half of #/Z/.) Again, 
the estimate aLow is an underestimate, 
and the procedure corrects it. 

It’s a good idea to normalize the divi- 
sor by shifting as far left as possible be- 
fore calling NN_DigitDiv, because oth- 
erwise it can take a long time to correct 
the estimates. If the divisor is normal- 
ized, then the initial estimate can’t be 
off by more than 2. NN_DigitDiv thus 
makes at most two and usually one cor- 
rection. We guarantee that the divisor is 
normalized, too, when we call NN_Dig- 
itDiv later. 


MP Multiplication 

Returning to grade school, remember 
that to multiply two numbers (a multi- 
plier and a multiplicand), you wrote 
them down, one above the other. Then 
you took the 1s digit of the multiplier, 
multiplied the multiplicand by it, and 
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(continued from page 44) 


wrote down the product. You took the 
10s digit of the multiplier, multiplied the 


5432 < multiplicand 

9876< multiplier 

0000¢ accumulator value 
32592 ¢ intermediate product 
32592 <« accumulator value 


38024 ~¢ intermediate product 
412832 
43456 
4758432 
48880 
53646432 < final product 


Figure 2: Grade-school 
multiplication with a twist. 
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multiplicand by it, and wrote down the 
product, below and one digit to the left 
of the first product. And so on; after you 
wrote down all the intermediate prod- 
ucts, you added them up, and the sum 
became the final product. 

Now let’s consider grade-school mul- 
tiplication with a twist. Instead of wait- 
ing until the end to add, suppose we 
accumulate the sum as we go along. 
Then we don’t have to store the inter- 
mediate products, just the accumulator, 
and the accumulator becomes the final 
product. 

Multiplying two MPs thus involves 
two procedures: one that multiplies an 
MP by a digit and adds the product to 
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an accumulator, and one that moves 
through the digits of the multiplier. 
The procedure NN_AddDigitMult, 
shown in Listing Six (page 118), does the 
first part. The digit is c, the multiplicand 
is d, the input accumulator is b, and the 
output is a. The multiplicand and ac- 
cumulators all have length digits. Index 
variable i moves through the digits of 
the multiplicand from right to left, with 
carry as the carry-in and -out. NN_Add- 
DigitMult returns the final carry-out. 


You can implement 
some number- 
theoretic operations 
such as greatest 
common divisor 
with the tools 
presented bere 


NN_AddDigitMult is similar to NN_ 
Add, except for two things. It calls the 
tool NN_DigitMult to multiply digits, 
rather than adding them with a built-in 
NN_DIGIT add (since we’re multiply- 
ing, not adding); and carry is a digit, 
not just 0 or 1. Here there are two con- 
ditionals. The first conditional adds the 
carry. If there’s a carry-out, the first con- 
ditional sets the carry to 1; otherwise, it 
sets the carry to 0. The second con- 
ditional adds the low-order digit of the 
NN_DIGITXNN_DIGIT product. If 
there’s a carry out, the second condi- 
tional increments the carry. NN_Add- 
DigitMult then unconditionally adds the 
high-order digit of the NN_DIGIT~x 
NN_DIGIT product to the carry and con- 
tinues. 

The procedure NN_Mult, shown in 
Listing Seven (page 118), multiplies MPs. 
The multiplier is b, the multiplicand is 
c, and the product is a. The inputs have 
length digits and the output has length 
2~ digits. Index variable i moves through 
the digits of the multiplier from right to 
left. In fact it doesn’t go all the way to 
the left, only as far as the leftmost non- 
zero digit. (The procedure NN_Digits, 
in Listing Ten, page 119, determines 
how far to go.) NN_Mult similarly re- 
duces the length of the multiplicand to 
avoid unnecessary multiplications. 

It's easy to see how NN_Mult works 
with NN_AddDigitMult to accumulate 
the product. Each call has a new mul- 
tiplier digit, b/i/ and operates on a new 
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part of the accumulator, &i/i/, NN_Mult 
stores NN_AddDigitMult’s carry as a new 
high-order digit of the accumulator. This 
is all quite similar to the illustration of 
grade-school multiplication that Figure 
2 describes, in which the first row is the 
multiplicand, the second is the multi- 
plier, and the last is the product. Inter- 
mediate rows give accumulator values 
and intermediate products. Intermedi- 
ate products are between a multiplier 
digit and the multiplicand. They are 
shifted left as many digits as the multi- 
plier digit. The accumulator value is the 
sum of the previous accumulator value 
and an intermediate product. Interme- 
diate products can be added directly in- 
to the accumulator. (NN_ Mult also calls 
two other procedures we haven't dis- 
cussed: NN_Assign, which copies an MP, 
and NN_AssignZero, which sets an MP 
to 0. See Listing Ten.) 


Last Step: MP Division 

We’re now ready to put together all our 
tools to do what’s generally considered 
the most difficult arithmetic operation: 
division. 

We've already seen how to divide a 
two-NN_DIGIT dividend by an NN_ 
DIGIT divisor. The estimate-and-correct 
method has served us well. We extend 
that method to MP division. 

For each digit of the quotient, we first 
estimate the digit by dividing the high- 
order two digits of the dividend (or what 
remains of it) by the high-order digit of 
the divisor, plus 1. Cif the high-order 
digit of the divisor is the maximum dig- 
it, the estimate is just the high-order dig- 
it of the dividend.) We then subtract the 
product of the estimate and the divisor, 
shifted left some number of digits, from 
what remains of the divisor. 

The estimate is an underestimate, so 
we may have to increase it, just as in 


O9876 
9432)053646432 
0000 
53646432 
43456 
10190432 


5432 
4758432 
38024 
956032 
5432 
4126092 
32592 
86912 
9432 
a2592 
2/160 
5432 
9432 
0 








NN_DigitDiv. We do so by subtracting 
the divisor, shifted left some number 
of digits, from the dividend, until what 
remains of the dividend is less than the 
shifted divisor. When we reach the 1s 
digit of the quotient, we have a re- 
mainder that’s less than the divisor, and 
we're done. Figure 3 illustrates grade- 
school long division with the latest 
twist. The first row is the quotient. The 
first part of the second row is the di- 
visor, and the second part is the divi- 
dend. The last row is the remainder. 
Quotient digits are estimated by divid- 
ing the two high-order digits of what 
remains of the dividend by the high- 
order digit of the divisor, plus 1. If what 
remains after subtracting a shifted mul- 
tiple of the divisor is too large, the es- 
timate is corrected. For instance, the 
digits 53 are divided by (5+1) to get 
an estimate 8 for the 1000s digit of the 
quotient. What remains after subtract- 
ing 80005432 is too large, so the es- 
timate is incremented, and another 
1000x5432 is subtracted. 

Dividing MPs, like multiplying MPs, 
involves two new procedures: one that 
multiplies an MP by a digit and subtracts 
the product from an accumulator, and 
one that moves through the digits of the 
quotient. 

The procedure NN_SubDigitMult, 
shown in Listing Eight (page 118), does 
the first part. It is just like NN_AddDigit- 
Mult. NN_SubDigitMult returns the final 
borrow-out. 

The procedure NN_Div, shown in 
Listing Nine (page 118), divides MPs. 
The dividend is c, the divisor is d, the 
quotient is a, and the remainder is b. 
The dividend and quotient have length 
cDigits, while the divisor and remain- 
der have length dDigits. (Notice that 
NN_Div returns both quotient and re- 
mainder, whereas NN_DigitDiv returns 


estimate: 5/6 = 0 


estimate: 53/6 = 8 
(too large) 
corrected estimate: 


estimate: 47/6 = 7 
(too large) 
corrected estimate: 8 


estimate: 41/6 = 6 
(too large) 
corrected estimate: 


estimate: 32/6 = 5 
(too large) 
corrected estimate: 
remainder 


Figure 3: Grade-school division with a twist. 
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only the quotient.) 

The index variable i moves through 
the digits of the quotient from left to 
right. (This is the first time we’ve 
moved in that direction, and we do so 
because division reveals the most sig- 
nificant digit first.) NN_Div estimates 
the quotient digit with NN_DigitDiv, 
subtracts the product of the estimate 
and the divisor with NN_SubDigitMult, 
and corrects the estimate with NN_Cmp 
(MP comparison, see Listing Ten) and 
NN_ Sub. 

NN_Div is a litthe more complicated 
than the other procedures; this is due 
to NN_DigitDiv’s two requirements. For 
efficiency, NN_Div normalizes the di- 
visor. It does this with NN_LShift CMP 
left shift, Listing Ten). The number of 
bits to shift left is determined with 
NN_DigitBits. NN_Div shifts both the 
divisor and the dividend, and at the 
end, shifts back the remainder with 
NN_RShift (MP right shift, Listing Ten). 
The quotient is unaffected by the nor- 
malization. 

For correctness, NN_Div extends the 
dividend with a leading 0 digit. This 
guarantees that the leading digit of the 
dividend is less than the leading digit 
of the divisor, as NN_DigitDiv requires. 
The condition continues throughout the 
main loop. 


BIG-NUMBER ARITHMETIC 


Conclusions 

In this article, we’ve gone from built-in 
C tools to multiple-precision addition, 
subtraction, multiplication, and division. 
One possible next step is modular arith- 
metic, where all results are divided by 
a predetermined quantity called the 
“modulus,” and only the remainder is 
kept. Modular arithmetic, and modular 
exponentiation in particular, are essen- 
tial to the RSA and DSS schemes. You 
can also implement some number-the- 
oretic operations such as greatest com- 
mon divisor with the tools presented 
here. Other next steps include convert- 
ing an MP to base 10—useful if you 
want to print your results—and even 
computing the digits of pi. 

It’s not difficult to implement the arith- 
metic operations more efficiently than 
we've done here, since we've tried to 
keep things simple. Assembly-language 
speedups would help a lot. Some tech- 
nical refinements would help, too, and 
indeed are available in various com- 
mercial and public-domain implemen- 
tations. Better and faster implementa- 
tions are encouraged. 


Editor’s Note 

Public-key cryptosystems such as RSA 
and DSS are subject to U.S. patents 
4,218,582 and 4,405,829, as well as oth- 


er patents issued to Stanford University 
and MIT. Public Key Partners of Sunny- 
vale, California holds exclusive licensing 
rights. Licensed software embodying the 
cryptosystems is available from several 
vendors, including RSA Data Security. 
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Virtual Memory, 64-bit 


Multi-megabyte applications on the PC 


n last month’s installment, I intro- 
duced the concept of a “virtual 
computer” patterned on the Cray 
supercomputer model—an archi- 
tecture defined by software rather 
than hardware. Using the PORT system 
(a PC-based environment that adheres 
to Cray’s design principles, as described 
in my June 1992 article), I showed how 
the virtual computer concept can 
achieve surprising performance on PCs 


and exploit RISC processors, and do so — 


in a seamlessly portable environment. 
This month, Ill discuss salient features 
of the PORT system itself, focusing on 
design decisions rather than imple- 
mentation details. As you recall, PORT 
has a philosophy diametrically different 


from that of UNIX and OS/2. It un- @ 


abashedly plagiarized the best features 
of a number of mainframe systems, es- 
pecially Cray’s CDC 6600. In short, 
PORT?’s architecture is more an incar- 
nation of several proven systems than 
an untested fresh approach. 

The key difference between PORT 
and most other systems is that PORT is 
totally dedicated to running multi-mega- 
byte Fortran applications. (Although 
PORT was originally designed for For- 
tran, its Fortran/C compiler is being mod- 
ified to accept C syntax.) For example, 
the file manager treats all directories as 
libraries (there is no need to MAKE li- 
braries), the linker automatically resolves 


Ian holds a BSc in Mechanical Engineer- 
ing and an MS in Aerospace Engineer- 
ing. He is the principal author of DISS- 
PLA and cofounder of ISSCO. He can be 
reached at Integral Research, 249 S. High- 
way 101, Suite 270, Solana Beach, CA 92075. 
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OBJ-type files from any specified direc- 
tories, and FIND C=WORKSP searches 
OBJ files Ggnoring all other types) for a 
COMMON block called WORKSP. 

It’s no secret that ANSI Fortran is un- 
suitable for systems programming. So 
rather than throw the baby out with the 
bathwater, we vastly extended Fortran 
(while preserving Fortran 77 compatibil- 
ity) instead of creating some new lan- 
guage. PORT’s Fortran/C incorporates 
most of the functions found in C—Bool- 
ean operations, shifts, bit-field opera- 
tives, a pointer type, and low-level I/O 
intrinsics—and has proven an effective 
systems-programming vehicle. The en- 
tire PORT system—including the com- 
piler, linker, editor, virtual memory man- 
agement, and system libraries (about 1 
million lines) —is written in Fortran/C. 





Virtual Memory Without Virtual Memory 
Mainframe-grade Fortran applications 
often expect virtual memory, or the abil- 
ity to run programs larger than available 
RAM by transparently swapping pieces 
to disk. Virtual memory has proved to 
have an even more important benefit: 


_ the ability to call massive programs from 


within one another. For example, the 
ability to call the editor from within your 
program, then the compiler within the 
editor, the file manager within the com- 
piler, and so on. Your own program 
may eventually be totally swapped out 
to disk, but this is transparent. 

Chaining multiple programs, which is 
often confused with multitasking, en- 
ables the development of gigantic ap- 
plications. For example, if you can de- 
pend on calling the editor, there is no 
need to write editor functions into your 
program. 

Being focused on Fortran/C, PORT 
achieves virtual memory simply by bas- 
ing its page size on the longest practi- 
cal subroutine (or function). As long 
as a subroutine never crosses a page, 
all variables and branches within that 
subroutine are local and don’t need vir- 
tual-memory address translation. This 
idea is somewhat similar to the seg- 
ment organization of the DOS huge 
model, except that PORT pages are a 
fixed length and can hold multiple sub- 
routines. As I pointed out in my June 
article, keeping most references local 
is equivalent to about an 85 percent 
“cache hit ratio” right off the bat. PORT 
local addressing is relative to the start 
of the current subroutine instruction 
and data areas rather than a segment 
base address or other hardware-de- 
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(continued from page 50) 

pendent artifice. The absolute RAM off- 
set address to the subroutine is reset 
automatically upon any call or return, 
like the DOS huge model. Unlike the 
huge model, however, if the PORT 
metacode does not find the target page 
in RAM memory, it generates a page 
fault and the page is rolled in from disk 
(behind the programmer’s back). Thus 
the program size is limited by the disk- 
staging area you assign, not the avail- 
able RAM. 

The current PORT page is 4096 64- 
bit words, or 32 Kbytes. (Support for 
64-Kbyte pages will be available soon.) 
The instruction and data areas for each 
subroutine are generally kept in sepa- 


PORT metacode, let's begin with an 
example. Consider the Fortran/C 


IE examining the operation of the 


source statement: Y(J)=X**11. As- _ 


sume that Y is a local array with DI- 
MENSION Y(100). 


_ instruction, see Op Code 06 in Table - 


1. As shown in Figure 3, C is 11, B is 
the scalar X, and A the indirect refer- 
ence to Y(J). The tags differentiate 
operand modes: C has tag 1 (imme- 
diate integer<64K), B tag 2 Cocal 
scalar), and A tag 3 (indirect). Tag 1 
uses the value of the 16-bit field itself 
(11). Tag 2 adds the field (257) to the 
| start address of the subroutine data 
area giving the word address, so X at 
[d/area+ 257] has the 64-bit value 5.7. 


the word points to the operand value, 
hence the value at [d/area+ 316] points 
to Y(J). All 64 bits as a single address 
is excessive, so the array base virtual 
address is in the upper half-word and 
the the lower 32 bits is for the length 
and index. In our example, Y starts at 
virtual page 5, word 862, and J has the 
value at [d/area+541], namely 25. 
The metacode decoder first shifts 
out the 16-bit fields corresponding to 
A, B, and C. In the 386/486 version, 
the decoder then masks out the 9- bit 
combined-tags value in the upper 16 
bits (1*64+2*8+3=83), uses this as 
an index to a table, and branches to 
entry 83. This simple branch decodes 
the tags 1,2,3 in one 386/486 instruc- 
tion. Operands C and B are straight- 
forward (tags 1 and 2). For A, the de- 


coder proceeds as with B, except that _ 


it now masks out the lower 16 bits and 
retrieves the 32-bit pair for J. The sec- 
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rate pages, so instruction pages don’t 
have to be written back to disk. Mea- 
surements On numerous programs since 
1979 have shown that the Fortran/C 
compiler generates 1.2 to 1.4 meta- 
instructions (64 bits each) per executable 
source statement. Hence a PORT page 
can accommodate a subroutine of up 
to about 3000 executable statements, or 
roughly 4500 typical lines. A single 
subroutine longer than that tends to be 
unwieldy and is best broken into small- 
er ones. 

The linker packs as many subroutines 
as possible into an instruction/data page 
pair, which holds around 60 to 200 typ- 
ical-length subroutines (32-Kbyte pages). 
If any one of those routines is called 





ond 16-bit field in the indirect word 
has the length of Y, namely 100; if J 
is less than 1 or greater than 100 an 
array-bounds fault is produced. In the 


- example, the decoder adds J—1=24 to 
_ the base v/address for Y (5:0862), giv- 
ing 5:0886 as the location of YVJ). Page — 


5 is a virtual page, so the decoder ref- 
erences entry 5 of its page-translation 
table for the real-memory page ad- 


_ dress, adds offset 886, and obtains the 


real-memory word address for Y(J). 
Having the values for B and C plus 
the location of A, the decoder shifts 
out the Op Code and performs the op- 


eration listed in Table 1. 


If entry 5 of the page-translate table 


_ is negative, it flags that YQ) is rolled 
: _ out to disk, and the decoder generates 
| Tag 3 is similar to tag 2, except that 


a page fault. If the value for either B 
or C is —0, it produces an uninitialized- 
variable fault. If the exponent in the 
upper 12 bits of B is 0, an unnormal- 
ized floating-point fault results. If A’s 


- exponent will be greater than 1023, a 


floating-point overflow fault is pro- 


_duced. These tests are each a matter 


of one or two instructions and add lit- 


tle to oe se) ties but add a whole 





Figure 3: ¥)=X OP 11 operands. 


and the page pair is not in RAM, the re- 
maining 59-199 ride in with it. The link- 
er exploits this to improve performance 
by grouping routines that call each oth- 
er into the same page pair and also pro- 
vides user-specified grouping. Indeed, 
the linker gives precedence to group- 
ing the branch of a call tree over page 
packing efficiency. Thus, a “miss” on 
one routine brings in many related rou- 
tines on the same two disk references. 
By contrast, most virtual-memory sys- 
tems go through a whole sequence of 
disk references before they develop a 
“working set.” Disk I/O being a pacing 
factor in many virtual-memory applica- 
tions, disk efficiency can outweigh pro- 
cessor performance. 


dimension of debugging power and 
implement virtual memory with sim- 


plicity. 

The upper six bits of the packed in- 
direct word determine whether A is 
YQ), YA,}D, YA+J), or a POINTER, 


and whether J is a variable or an im- 
‘mediate constant; for example: Y(5), 
-YUL5), and YU.+5). In the YU.) and 


Y(+)J) cases, the index field points to 
a second level of indirection, for 
YQ,J,K) a third level, and so on. All 
instances of Y(J) in the subroutine 
point to the same indirect word at off- 
set 316, so there is only one entry for 
Y(J), not a copy for each source line 
at which it occurs (reducing data-area 
redundancy). If the array is longer than 
16 bits (64 Kwords) or starts at any 
value other than 0 or 1 (for example, 
DIMENSION Y(—100:100)), the length 
field points to a word in page 0 con- 
taining the actual limits. 

The metacode may appear daunt- 


ing, but the bottom line is that it con- 


centrates the maximum information in 
as few bits as possible, thereby mini- 
mizing the entropy. Using an 8088 


PC/XT to execute it would be a joke, 


a 32-bit 386 is marginal; RISC proces- 
sors with their one instruction per cy- 
cle and huge banks of registers are 
just what the doctor ordered. Most im- 
portantly, the mass of information pre- 
sented to the RISC processor allows it 
to change the order of the logic, there- 
by plugging free cycles and memory- 
reference delays with useful RISC in- 
structions. (As long as the decoder 
produces the result expected, the in- 
ternal sequence is academic.) 


——1.H. 
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(continued from page 52) 

A key difference between the DOS 
huge model and PORT is that PORT 
permits arrays to cross pages: Arrays can 
be multi-megabyte with an overall lim- 
it of 32 Mbytes per program (310 Mbytes 
with 64-Kbyte pages). This may not be 
anywhere near the 4-gigabyte limit of 
32-bit addressing, but in practice, a sin- 
gle program of even 5-10 Mbytes in- 
volves thousands of subroutines, and I 
have found it to be unwieldy to devel- 
op beyond a certain point. (Bear in 
mind that a megabyte amounts to about 
100,000 lines of Fortran/C, accounting 
for data areas and typical arrays.) The 
4-gigabyte number is a bit academic— 
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simply dedicating a 4-gigabyte disk-stag- 
ing area is beyond most PCs, worksta- 
tions, and even mainframes (per user). 
Just think of the disk-swap time! In prac- 
tice, since PORT programs can call one 
another, even 32 Mbytes per program 
has yet to prove a limitation. What is 
not academic is that 32-bit addressing 
wastes 1—2 bytes 99.99 percent of the 
time, hogging the processor’s critical 
prefetch queue. 

Virtual-memory addressing is only ap- 
plied to access across pages. Since no 
subroutine can straddle pages, virtual- 
memory translation is only necessary for 
array references, call/return, and point- 
ers. (COMMON blocks, strings, and oth- 
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er structures are presented as arrays to 
the metacode so there is no distinction 
at the metacode level.) 


All 64 Bit 

Like mainframes and supercomputers, 
PORT targets a 64-bit bus: Addressing 
is in terms of 64-bit words, and all in- 
structions plus data items are processed 
as multiples of 64-bit words. A byte is 
regarded as just an 8-bit field, so the 
metacode changes a byte value by 
bringing in its 64-bit word, modifying 
the 8-bit field and writing out the whole 


The 4-gigabyte 
number is a bit 
academic— simply 
dedicating a 
4-gigabyte disk- 
staging area 1s 
beyond most PCS 





word. (The cost of a byte and 64-bit 


_word are identical on a 64-bit bus.) “All 


64-bit” execution may be excessive for 
80x86 PCs, but most plug-in RISC cards 
(such as i860-based cards) use a 64-bit 
bus locally. Boards like the Hyperspeed 
D860 allow multiple cards to intercon- 
nect via ribbon cables carrying a blis- 
tering fast 64-bit local bus. 

Although the advantage of 64 bits for 
performance is clear, it has proved 
surprisingly beneficial to Fortran/C cod- 
ing. All Fortran/C variables are 64-bit, 
or a multiple thereof. UNTEGER*2, 
REAL*4, and so on, are accepted but ig- 
nored.) Strings are 64-bit aligned and 
padded to a word boundary. Thus in- 
tegers, floating point, pointers, and ~ 
strings can all be freely intermixed. For 
example, INTEGER, REAL, CHARACTER, | 
and POINTER type variables can be in- 
terchangeably passed through the same — 
subroutine argument and EQUIVA- 
LENCEd to one another, contrary to both 
ANSI Fortran and C rules. Interchange- 
ability simplifies dynamic record struc- 
tures containing variable-length com- 
ponents or omitted items. 64-bit integers 
have proved invaluable for packing and 
manipulating multiple values simulta- 
neously; for example, A=B moves eight 
bytes at a pop. 


The Metacode 
As mentioned in the previous articles, 
PORT defines its own machine-inde- 
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(continued from page 54) 

pendent “instruction set” customized to 
Fortran/C. This metacode is executed 
via a processor-dependent decoder sup- 
plied for each platform. Each meta- 
instruction is a 64-bit word organized as 
shown in Figure 1. All meta-instructions 
have the form A=B op C. The tag for 
each operand A, B, and C determines 
the operand mode—151, J, X(), YC), 
for instance. The text box “PORT’s Meta- 
code” describes metacode operation. 

Table 1 lists all three-operand in- 
structions (A=B op C). Because it’s 
wasteful to use three operand slots for 
two-operand instructions (A=op B), 
these are grouped as instruction 25h us- 
ing the value of C to distinguish them; 
see Table 2. Note how specific the in- 
structions are to Fortran and C. 

Since each meta-instruction needs to 
be interpreted, the decode time is crit- 
ical. The metacode is therefore designed 
to be decoded as fast as possible. For 


Bits / 3923 16 16 


16 
(Op code [cipial c [e [A 


Tags Operands 





Figure 1: PORT meta-instruction 
format. 
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example, even though the PORT page 
is 4096 words, needing only 12 bits, it 
is assigned a 16-bit field because most 
processors have instructions that can 
operate on 16 bits directly. Likewise, the 
3-bit tags describing the contents of A, 
B, C are grouped as a 9-bit field, rather 
than being tacked onto their respective 
A, B, and C—allowing the decoder pro- 
grammer to use a fast 512-way indexed 
branch to interpret all three tags in a 
single assembly instruction. 

Without exception, all meta-instruc- 
tions have an identical format. Further- 
more, the metacode operand format is 
independent of the actual operation. 
(Both the 386/486 and i860 decoders 
interpret the operands before they even 
look at the Op Code.) This rigorous in- 
dependence not only speeds the de- 
coding on existing processors, but has 
an eye on future chips, which will incor- 
porate multiple independent processors 
on the same silicon (Motorola’s recent- 
ly announced M88110 RISC multipro- 
cessor, for instance). A context-free for- 
mat lends itself to parallel decoding the 
A, B, and C operands and even to pre- 
interpreting instructions. Obsolescence 
being reality in computers, we have tried 
to think as far ahead as possible. 


— \llegal op code (catches 0 as an instr.). CALL 


(+):1 Integer ADD. 


(-):i Integer SUBTRACT. 


(*):i Integer MULTIPLY. 
(/):i Integer DIVIDE. 


AND 
OR 
XOR 
ANDF 


(“/):i Integer INCLUSIVE DIVIDE. ORF 





Bx+ICzi 
MOD:i 


(>):i 
(#):i 
(<=): 
(=>) 
arith _L 
arith_R 
Bx#C:r 
(+):r 
Cy 
tay 
(/):r 


POLYNML 


INCADR 
KCHECK 
(=):r 

(<)r 

(>) 

(#):r 
(<e)7 
(=>) 7 
SBYTES 
LBYTE8 
BreakPt 
GO TO* 
jmp#0 
jmp=0 
ARYMOV 
20p 

DO end 


Real-to-integer power. 

Integer remainder. 

Integer test EQUAL. 

Integer test LESS THAN. 

Integer test GREATER THAN. 

Integer test NOT EQUAL. 

Integer test LESS THAN OR EQUAL. 
Integer test GREATER THAN OR EQUAL. 
Arithmetic LEFT shift (preserve sign). 
Arithmetic RIGHT shift. 

Real to power of a real. 

Real ADD. 

Real SUBTRACT. 

Real MULTIPLY. 

Real DIVIDE. 

Polynomial expansion evaluate. 
Increment POINTER. 

Test for operand error. 

Real test EQUAL (r/tests use fuzz factor). 
Real test LESS THAN. 

Real test GREATER THAN. 

Real test NOT EQUAL. 

Real test LESS THAN OR EQUAL. 
Real test GREATER THAN OR EQUAL. 
Replace 8-bit byte (in array or word). 
Extract 8-bit byte. 

Breakpoint. 

List GO TO. 

Branch if NONZERO. 

Branch if 0. 


Block op (array search, init, copy, and so on). 
- Two-operand instructions (see Table 2). 
Loop test, increment/decrement, and branch. 


Table 1: Metacode three-operand instructions. 


56 


XORF 
SBYTE6 
LBYTE6 
|_ FIELD 
S_FIELD 
ATAN2 
SIN_COS 


‘shift L 


shift_R 
StrngOP 
ByteOP 
Unpack8 
Pack8 
CALL_P 
Load_P 
Reset_P 


(+):¢ 
(—):¢ 
(*):¢ 
(/):¢ 
Bx+IC:c 
CMPLX 


(=):c 
(<)iC 
(>):ic 
(#):c 
(<=):C 
(=>).C 





The Peripheral Processor 

As previously mentioned, all I/O func- 
tions are handled by the peripheral pro- 
cessor (PP) program. Although designed 
to execute on a separate processor to 
the metacode computation processor 
(CP), I discussed how the PP coexists 
surprisingly well with the CP in a sin- 
gle-processor environment. Each PP ac- 
tion is specified via a 5x64-bit word 
structure deposited in the CP memory. 
Table 3 lists the current PP action codes. 
The PP is free to honor the requisite ac- 
tion in any way it chooses, thus enabling 
PORT to be implemented seamlessly in 
diverse host environments. 

Again, all I/O data items lie on a 64- 
bit boundary and always have a length 
which is a multiple of 64 bits. (Strings 
are padded, as necessary.) This enables 
the PP implementor to utilize the max- 
imum bandwidth available on the ma- 
chine; for example, the PP lends itself 
readily to 32-bit EISA and Micro Chan- 
nel implementation. By way of com- 
parison, UNIX, OS/2, and DOS I/O are 
specified in terms of a byte offset and 
byte count, which requires a costly shift 
of the data block if the start byte does 
not fall on a 32-bit boundary. 

(continued on page 60) 


Call subroutine. 

Logical AND. 

Logical OR. 

Logical XOR. 

Boolean AND. 

Boolean OR. 

Boolean XOR. 

Replace 6-bit byte (in array or word). 
Extract 6-bit byte. 

Extract arbitrary field from word. 
Replace arbitrary field in word. 

Ratio arc tangent. 

Compute both sine and cosine of C. 
Boolean LEFT SHIFT. 

Boolean RIGHT SHIFT. 

Text op (copy, search, concatenate, and so on). 
Byte-string op (nonspecific StrngOP). 
Unpack string to word-per-byte array. 
Pack eight bits/word array as string. 
Call program on another processor. 
Load program on another proc. 
Restart program on another proc. 

Not used. 

Illegal op code (catches —0 as instr.). 
Complex ADD (c/tests use fuzz factor). 
Complex SUBTRACT. 

Complex MULTIPLY. 

Complex DIVIDE. 

Complex to integer power. 

Real and imaginary to complex. 

Not used. 

Complex test EQUAL. 

Complex test LESS THAN. 

Complex test GREATER THAN. 
Complex test NOT EQUAL. 

Complex test LESS THAN OR EQUAL. 
Complex test GREATER THAN OR EQUAL. 
Not used. 
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(continued from page 50) 

To avoid the PP becoming involved 
in the virtual-memory process, the CP 
guarantees that no I/O block crosses a 
page boundary. As an example, stan- 
dard Seismic SEG-B tapes have records 
of several megabytes apiece. The For- 
tran/C program can readily define a 
multi-megabyte array to hold the record, 
but the PP transfers these records as a 
sequence of one-page blocks. This 
places the onus on PORT and the CP 
to process any virtual-memory page 
faults on the huge array, between the 
record pages, thereby simplifying the 
PP implementation. The CP and PP are 
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designed to execute in parallel. Even 
on a single 386/486 processor, the PP 
caches tape I/O using the DOS timer 
interrupt to service the drive concur- 
rent with the CP program. 

To illustrate the effectiveness of these 
ideas, PORT is able to copy a 2500-foot, 
9-track, 6250-bpi Seismic SEG tape in 
4.5 minutes between two streaming 125- 
ips drives using a Fortran program, on 
a stock 486/33 EISA PC (sans i860). 
Multiplying 2500 feet by 125 inches per 
second gives a theoretical lower limit of 
four minutes. By comparison, an oil 
company had run under UNIX on a 
Sun-compatible rated faster than a 











































_ Hyperbolic tangent. - 


Table 2: Metacode two-operand instructions. 
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01 NOT Logical invert. . 

02 NOTF - Boolean complement. 

03 PP req Flag peripheral aaa and halt. 

04 GO IO _ Unconditional branch. . 

05 (=) ___lransferonewod  # # # 

06 RETURN Subroutine RETURN. 

07 Intrpt _ Suppress interrupts flanore PP flag). 
08 ABS Get absolute value. 

oo INT Truncate real to ieee 

OA FLOAT Integer to real. |. 

OB _ SIGN ___ Assign sign of B to re 

0c STATS ‘Get memory-error statistics. 

0D VADDR __ Set POINTER to address. |. 
OE EXECUTE __ Execute local procedure - : ~~. 
OF MEMCHK  _ Force memory refere test BAM). i ws 
10 EXCHNG __ Swapcontents of AandB. | : 
11 IROUND —__Roundrealtointeger. 

12 — _ Unassigned. 

13 SQRT Square Root. 

14 SIN = sine. 

15 COS _ Cosine. | 

16 TAN _ Tangent. 

/ ArcSIN _ Inverse sine. 

18 — ArcCOS __ Inverse cos. 

19 ~ ArcTAN _ Inverse tan. 

1A LOG Log to base e. 

1B LOG10 Log to base 10. 

1C EXP e to power B. 
10. EXP10 10 to power B. 

1E (+):i _ Direct INTEGER ADD. 

1F (-):i - Direct INTEGER SUBTRACT. 

20 (*):i _ Direct INTEGER MULTIPLY. 

21 Ai Direct INTEGER DIVIDE. 

ee arith_L Direct arithmetic LEFT SHIFT. . 
2 arith_R Direct arithmetic RT SHIFT. 

24 shift_L _ Direct Boolean LEFT SHIFT. 

25 shift_R _ Direct Boolean RT SHIFT. 

26 MOD:i _ Direct remainder. 

27 ORF Direct Boolean OR. 

26 ~ANDF © Direct Boolean AND. 

29 XORF _ Direct Boolean XOR. 

2A (+):r Direct REAL ADD. 

2B (-):r Direct REAL SUBTRACT. 

2C (fy _Direct REAL MULTIPLY. 

20 ~~ ‘Or  __ Direct REAL DIVIDE. | 

2c REAL Real part of complex. 

ar CMPLX Real to complex. 

30 CONJG Complex conjugate. 

ot SINH Hyperbolic sine. _ 

a2 COSH _ Hyperbolic cosine. _ 


SPARC 4 and, after a year of custom de- 
vice driver coding plus C conversion, 
was able to reduce the time from 12 to 
6.5 minutes. The 4.5 minutes reflects a 
sustained data-transfer rate of 26250 
125x(4/4.5)=1.3 Mbytes/sec (account- 
ing for interrecord gaps) from Fortran. 
(At the last SEG show in November 
1991, the PC was hidden under the table 
because the company felt nobody 
would believe a PC could do what it 
was doing.) 


Debug Insecticide 

True to the maxim that the last 5 per- 
cent of the bugs take 95 percent of 
the development time, a bug in the 
DISSPLA shading algorithm took us al- 
most two years to find. It showed up 
on one mainframe, but not another. It 
even disappeared from execution to ex- 
ecution. Not even the most sophisticat- 
ed debuggers were any help, because 
the program never failed when the de- 
buggers were active. New DISSPLA re- 
leases were held up because every time 
we thought we had it, the bug popped 
up anew. (Sometimes we thought it de- 
pended on the phases of the moon.) It 
turned out to be a truncation from float- 
ing point to integer used to compute 
the index to a workspace array, whose 
contents depended on memory at the 
time the program was run. (I=INT(A) 
is highly processor dependent when A 
falls between 0.999... and 1.000...01.) 
Experiences like these on standard sys- 
tems almost drove me into a different 
line of work, and I resolved that when 
we designed what is now PORT, de- 
bugging features would take priority 
over any performance considerations. 
The first time it was run under the SU- 
PERSET precursor to PORT, the bug 
popped up—and was nailed. 

The design of PORT assumes that 
nothing is working properly. For ex- 
ample, the file-management system at- 
taches an internal software checksum 
to every record of every file. Whenev- 
er a file is copied on the disk or trans- 
ferred to or from tape, this checksum is 
validated by the PORT software. We’ve 
even caught problems in the error- 
checking circuitry of the hardware! 

The PORT metacode performs nu- 
merous checks on every instruction 
without exceptions, not even for the 
PORT system itself. These include: 


e Array-bounds checking on every array 
reference, including COMMON blocks, 
strings, and subroutine arguments. 

e Uninitialized-variable checks on all 
arithmetic and compare instructions 
(but not Boolean). The linker initial- 
izes memory to —O (0 with sign bit 
set), which the metacode never gen- 
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tempt orary 
please wait. 


C: Creating 
swap | file, 





ow much longer can you afford to wait? 


Create Overlaid Programs-Fast. 


BLINKER™, the world's first and 
fastest dynamic overlay linker, 
reduces your link time to seconds 
and reduces program memory 
requirements. Now you can use 
one linker for all your software 
projects. 


One Linker, Many Languages. 


BLINKER 2.0 links and automati- 
cally overlays DOS programs 
written in Microsoft®? C, BASIC, 
Assembler, QuickBASIC” 
Fortran, Pascal, Watcom” C, 
Zortech C++, Clipper, FORCE® 
and in Borland® C, C++, 
Assembler, and more. 


Save Time and Memory. 


BLINKER removes the need for 
overlay structures, simplifies 
program design and reduces 
memory requirements to save 
you time, effort and memory. 


Memory Swap Function. 
BLINKER is the ONLY linker to 


offer an integrated memory swap 


function, So you can run other 
large programs from within 
your program, with negligible 
memory overhead. 




















High Per forfniance Hiern Overlay Linker 


Don't Settle for Less. 


Other major features include full 
CodeView® support, use of 
EMS/XMS at program run-time, 
and enhanced execution speed 
of overlaid code. 


Time is Money. 

BLINKER offers all this in a frac- 
tion of the time it takes to link 
with your current overlay linker. 
You know time is money, and link 
time is no exception. 


Free Demo 

To try our free demo 
on your own code 
and for more information 
contact your local distributor: 


Call: 800-323-1809 | 
FAX: 602-443-0659 


PROGRAMMER 5 


8283 North Hayden, Suite 195, 
Scottsdale, AZ. 85258 





BLINKER is available 
In 5.25" or 3.5" diskette 


format. Please specify 
when ordering. Blinkinc 


VISA : cs 





PO. Box 7154 
Richmond VA 
rcs {a | 
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(continued from page 60) 
erates on any arithmetic operation. 

e Invalid pointers. Pointers are 64-bit 
and carry the length of the original 
item together with a mandatory check 
pattern. 

e Unnormalized floating point. Usually 
an integer is passed through a real ar- 
gument. 

e Field overflow (such as attempting to 
store a value greater than 255 in an 
8-bit byte). 

e Field, or shift count, ws se end of 


Output text line — 
Inputtextline 
Set flag chars _ 
Set date/time < : 
Get error stats — 
Jumper ports 

Get date/time — 


Discard interrupts. 
Jump history — 
Mag tape op. 


Generate tone 


Set spooling — 
Set COM protocol 
Swap disk page — 
High-speed I/O 
Terminate PORT 

- Switching program: 
Host file ae | - 
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a 64-bit word. 

e Invalid text strings—strings not ter- 
minated by a null (0 byte). 

e Invalid loop limits (DO J=1,N with 
N<=0, for example). 


These bugs tend to occur during ex- 
ecution, and no compiler “lint” can pick 
them up. The DISSPLA bug was de- 
tected by the uninitialized-variable 
check. In most cases, an uninitialized 
fault is just a nuisance, but every so of- 


ten it see across something really 








Figure 2: Program execution in a multiprocessor environment. 
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serious and pays for itself. The checks 
are conducted by the metacode decoder 
program (in the CP) as part of each 
meta-instruction’s execution. As I point- 
ed out in June, a RISC-based decoder 
can often bury the checks in free cycles 
so that they add little to the overhead. 

The metacode approach provides oth- 
er bonuses for code development. For 
example, the decoder can be requested 
to store the last 2048 branches (provid- 
ing a software “logic analyzer” execu- 
tion history). The decoder stores the 
count of meta-instructions executed by 
subroutine, furnishing a “profiler” that 
has proved invaluable for optimizing 
multi-megabyte applications. The PORT 
debugger can request the decoder to test 
for an event on every meta-instruction. 
Since the tests are embedded in the de- 
coder, the debugger slows the program 
by just a few percent. Unlike debuggers 
that insert code, there are no loop-holes 
and no need to recompile. Basically, the 
metacode decoder represents modifiable 
“microcode” that can be programmed to 
do whatever you like. 

Another subtle use of the modifiable 
microcode is the application of a “fuzz 
factor” to all floating-point compares 
(for example, IF(B=1.0) THEN succeeds 
if B=0.999...). The fuzz factor is user 
modifiable and circumvents the night- 
mare DISSPLA bug I described. 

An important aspect of virtual mem- 
ory is that the PORT debugger is avail- 
able at all times on all programs, espe- 
cially on release versions at user sites. 
The linker attaches a copy of its tables 
to the end of each executable image (no 
exceptions). These include the location, 
programmer name, date/time, and 
source installation of all the OBJ files. 

Upon any fault (ike an array bounds 
or bad pointer), the debugger swings 
into action, swaps the program back to 
disk, and loads the linker attached ta- 
bles. Using this roadmap, the debugger 
can probe any program—even itself. 
The key difference between the PORT 
debugger and Codeview is that PORT 
can debug the program at the user site 
with the user’s proprietary data in the 
exact state of its failure. (PORT doesn’t 
know “memory dumps.”) In multi-mega- 
byte apps developed by a team, the abil- 
ity to pinpoint the subroutine version 
by date/time and the programmer re- 
sponsible has proved worth its weight 
in gold. 


Multiprocessor 

The philosophy of PORT is to throw as 
many processors as possible at the task, 
not as many tasks as possible at the pro- 
cessor. Dedicating the entire system to 
the current application simplifies multi- 
processor handling, thereby improving 
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throughput rather than wasting cycles 
in overhead. The elegant simplicity of 
Cray’s memory-mapped CP/PP com- 
munication scheme proved easily ex- 
tensible to multiple processors. 

All processors are under the control 
of the CP, which in turn is under the 
control of your program. The program 
treats each processor as if it were exe- 
cuting a subroutine. To see how this 
works, consider using an auxiliary pro- 
cessor to do three-dimensional render- 
ing; see Figure 2. The rendering is done 
by a call to RENDER_3D (OBJ_LIST, 
FRAME_BUF), for example, where 
OBJ_LIST is the list of solid-object co- 
ordinates, and FRAME_BUF is the tar- 
get frame buffer. In a single-processor 
environment, RENDER_3D would just 
be a normal Fortran/C subroutine, and 
indeed, the linker expects a Fortran/C 
version. Prior to calling RENDER_3D, 
the program requests the CP to load 
RENDER_3D on processor 1 via a 
LOAD_PROCESSOR system call. If there 
is no processor 1, or no assembly ver- 
sion of RENDER_3D for it, the CP re- 
turns an error status to the user pro- 
gram. The program can then opt to 
terminate or proceed using the Fortran/C 
version. Given that everything is kosher, 
the metacode decoder on the CP inter- 
cepts the RENDER_3D call. Instead of 
executing the normal Fortran/C call, the 
decoder makes sure that the OBJ_LIST 
and FRAME_BUF arrays are loaded in 
RAM, passes their addresses to the REN- 
DER_3D version in processor 1, and 
fires up processor 1. The decoder then 
executes a return to the user program 
as if it had executed the Fortran/C REN- 
DER_3SD call. 

The user program executing in the 
CP and RENDER_3D in processor 1 pro- 
ceed in parallel. When RENDER_3D is 
done, it sets a bit in a word tested by 
the CP prior to executing each meta- 
instruction (an interrupt without a hard- 
ware interrupt). A user-supplied inter- 
rupt-processing routine is then called. 
Alternatively, the program can just opt 
to poll a common status word it defines, 
to avoid interrupt handling. Upon de- 
tecting completion by processor 1, the 
program can send FRAME_BUF to the 
VGA and make another call to REN- 
DER_3D or other routine loaded in pro- 
cessor 1. The bottom line is that your 
program is in control of all processors 
at all times. 

Note that RENDER_3D has two ver- 
sions: a high-speed, machine-dependent 
assembly version and a slower Fortran/C 
version. Thus the seamless portability 
of the program is preserved, albeit with 
the slower version. This feature is also 
useful for the orderly development of 
multiprocessor applications. If you go 
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to the hassle of multiprocessor, speed 
is usually your criterion; PORT there- 
fore focuses on running tight assembly 
modules, rather than a bureaucratic, 
symmetric, multithreaded C system. (Sil- 
icon Graphics workstations atest to the 
speed of a tightly coupled MIPS 3000 
RISC processor rendering 3-D.) 


Conclusion 
The underlying PORT theme is to min- 
imize entropy—to concentrate the infor- 
mation and focus on the objective. Bug 
checks increase entropy, but a program 
with a serious bug has infinite entropy. 
In these three articles on “Personal 
Supercomputing,” I’ve endeavored to 
introduce an alternative to the UNIX and 
OS/2 approach. My intent is not to dis- 
parage their multitasking philosophy, 
but to show that it is not gospel and 
that there is room for diverse system de- 
signs. PORT shows that by targeting a 
specific class of applications, it can over- 
come many of the limitations of the 
UNIX approach, for its constituency. 
Conversely, PORT is not the ideal ve- 
hicle for agenda managers, word pro- 
cessors, and multiple TSRs. The PORT 
design realizes this and endeavors to 
complement the host system rather than 
replace it. Nevertheless, technology has 
moved far beyond the PDP 11 and PC/ 
XT. DOS, UNIX, and OS/2 must come 
to terms with high-speed, multiple RISC 
processors and 64-bit operation to re- 
alize the full potential of the next wave 
of technology. 


DDJ 


Vote for your favorite feature/article. 
Circle Reader Service No. 4. 


C Language 


#15 


Q: [had the definition char x/6] in one 
source file, and in another I declared 
extern char *x. Why didn’t it work? 


The declaration extern char *x simply 
does not match the actual definition. 
Use extern char x |. 


#22 


My compiler complained when | 
passed a two-dimensional array to a 
routine expecting a pointer to a 
pointer. 


The rule by which arrays decay into 
pointers is not applied recursively. An 
array of arrays (that is, a two- 
dimensional array in C) decays into a 
pointer to an array, not a pointer to a 
pointer. 


Copyright © 1992 Steve Summit 
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Inside the 


Windows Scheduler 


Making a black box transparent 


Matt Pietrek 


he Microsoft Windows API is @4 


full of “black boxes.” The SDK | 
documentation tells program- | 
mers how to call the various | 
Windows functions, but rarely 3 
explains why you must do something. — 
All too often, we're told, “You must do | 
this for your program to work,” or “You | 
can’t do that in this particular situation.” | 
But seldom is there any description of | 


what is going on beneath the surface. 


A principal “black box” in Windows | 
is the scheduler. In this article, ] exam- | 
ine how the Windows scheduler is im- | 
plemented, and how it interacts with | 
the rest of the Windows. In explaining | 
the scheduler, I refer to data structures | 
internal to Windows. It will help your | 
understanding of the scheduler if you | 
are at least somewhat familiar with struc- | 
tures such as the task database, the | 
module table, and the message queue. | 
Many of these structures have never ' 


been publicly documented, but fortu- 
nately some information is now starting 
to appear in print (see “References”. 


The Two Schedulers 

Please note that, in this article, I discuss 
the Windows scheduler as implement- 
ed in the KERNEL module. (As you may 
recall, Windows is principally composed 
of three DLLs: KERNEL, USER, and 
GDI.) The Windows scheduler is the en- 
tity that controls execution of individu- 
al Windows apps. This scheduler should 


Matt works for a California program- 
ming-tools vendor, specializing in debug- 
gers and file-format programming. He 
can be reached at CIS 76117,1720. 
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that exists at a lower level within the 
Windows environment. That other 
scheduler is the time-slicing virtual ma- 
chine (VM) scheduler implemented by 
WIN386.EXE. The WIN386 scheduler 
only exists in Enhanced mode, and is 
used to preemptively switch from one 
VM to another. A VM is a hardware-sup- 
ported task that can contain a DOS ses- 
sion or the entire set of Windows apps. 
The Windows scheduler runs within the 
system VM; the system VM is the first 
VM to be created, and contains all the 
Windows apps. The Windows sched- 
uler is therefore only active when the 
WIN386 scheduler has given a timeslice 


| to the system VM. 








not be confused with another scheduler 


Because there is no time-slicing at the 


level of KERNEL, the Windows sched- 
_uler is conceptually rather simple (al- 
| though complicated in its implementa- 
| tion). Unlike the OS/2 scheduler which 
_ manages a hierarchy of thread classes 
| (further subdivided into priority levels 
| within each class), the Windows sched- 
| uler is fairly “flat.” Because there is no 
| preemption, a Windows app will con- 
| tinue to run until it gives up the CPU to 
| another program. If a program enters a 
| tight loop, or otherwise “hogs” the CPU, 
| the system will effectively be deadlocked 
until the application yields the CPU. The 
act of yielding is either done explicitly, 
| or by calling a Windows-API function 
| that causes a yield to occur. 


| Scheduler Overview 

. The basic idea of how a Windows app 
| gets the CPU is relatively simple, and 
’ can be summarized as: “The first per- 


son in line who has a dime gets to use 
the phone next.” The “line” in this case 
is the linked list of Windows tasks. The 
“dime” is a nonzero value in the “event- 
count” field (offset 6) of the task 
database. Of course, the actual imple- 
mentation of the scheduler is more com- 
plex than that, but it’s a good first-or- 
der approximation. 

So how do tasks get these events? The 
event-count field is incremented by calls 
to PostEvent(), an undocumented func- 
tion called by routines such as PostMes- 
sage) and SendMessage(). Additional- 
ly, it is called when there are events in .. 
the “system queue” destined for the task, 
and when invalid regions need to be 
repainted. The system queue is a “hold- 
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THE REST ARE DISCOVERING SQA. 


wie and procedures, monitor test results, 
assign and track problem resolution, and 
incorporate your own quality metrics. 


—_ SQA:MAnacer's sophisticated analysis 
and reporting not only allows you to to 
view a snapshot of your current test 
activities, but also to project future costs 
and product reliability. 




































If you are still testing software with 
manual methods, home-grown tools and 
dozens of engineers working long hours 
just before release, here's how to ces the 
monkey off your back. 


ADVANCED TEST EXECUTION 
FOR THE DESKTOP | 


Start with SQA:Rosor, the automated 
test tool for GUI development that is so fast, 
so affordable, and so easy to use that it may 
soon be on every engineer's desk. 


SQA:RosoT is designed to automate the 
repetitive, labor intensive tasks involved in 
testing Microsoft Windows™ and OS/2™ 
Program Manager applications. SQA:ROBOT 
lets you create your test scripts by recording 
every keystroke, mouse movement and 
window event. With a simple point and 


STEP UP TO TEAM TESTING 


Both SQA:Ropot and SQA:MANAGER are 
available in either standalone or networked 
versions. So they're sophisticated enough to 
handle the largest corporate project, and 
-modular enough to benefit the individual 
user. Software Quality Automation's 
concept of a central test repository ensures 
that every member of your development 
and testing organization can share the same 
click, you can replay one or an entire series information. The result? What we call 'team 
of test procedures for unattended test testing’, with productivity benefits that will 
automation. - amaze you. 

: join companies like Unisys, AEtna, 
Borland, Easel, IBM, and Lotus who have 
made the decision to save time and money 
with SQA's integrated test workbench. 


Or you can just monkey around. 
SQA:RoBOT starts at $495.00 
SQA:MANAGER Starts at $995.00 


SQA products are available for 
MS- Windows, OS/2 and UNIX. 


SPREAD THE WEALTH 





You can share the benefits of automated 
test execution with every member of your 
development team using SQA:MANAGER, 
the award-winning test management tool. 
Working in tandem with SQA:RosotT and — 
other third-party test tools, SQA:MANAGER 
becomes the hub of your integrated test 
workbench. With it you can document test | 
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(continued from page 64) 

ing area” for input from hardware de- 
vices such as the mouse and keyboard. 
In addition, each task has its own mes- 
sage-queue segment for handling mes- 
sages posted to it. The handle for this 
segment is stored at offset 20H in the 
task database. It is important to note 
that the event-count field in the task 
database does not correlate directly to 
a message sitting in the task’s message- 
queue segment, waiting to be retrieved. 
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Update task profiles 
Tf ( TDB signature not OK) 


goto startup_this_task 
Walk_through_task_list: 


Try next_task: 


ShrinkHeap () 

DiscardFreeBlocks() 

IsUserIdle() 

if ( fPokeAtSegments != Q ) 
if ( USER is idle } 


INT 28h 


INT 2Fh, AX = 1689h 


Does_this_task_have_an_event?: 


point to next TDB 
goto Try_next_task 


goto Reschedule_done 
startup_this_task: 


goto Reschedule_done 
TF ( InDOS Elag D 
goto Try_next_task 


Increment InScheduler flag 


Lock the global heap 
Save 55:S5P in current IDB 


if ( 80x87 present ) 


if ( 8@x87 present ) 


Decrement InScheduler flag 


Unlock the global heap 
Reschedule_done: 


Return to caller 


Figure 1: Pseudocode for RescheduleO. 
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Save registers on stack of outgoing task 


goto Walk_through_task_list 
Get DirectedYield field in TDB into AX: 
If ( DirectedYield hTask == 
goto Walk _through_task_list 
If ( event count in DirectedYield TDB != @ ) 


//--------------- TOP HALF OF TASK WALK------------- 
Point to the first task in the Linked list of tasks 


if ( not pointing past the end of the task list ) 
goto Does_this_task_have_an_event? 
[--------------- THE [OLE LOOP-------------------—- 
// In KRNL386 only 
// In KRNL386 only 


Call routine to load a boot time module segment 
goto Walk_through_task_list 


goto Walk_through_task_list 

//------------- BOTTOM HALF OF TASK WALK---------------------- 
if ( event count field == @ ) 

ii TASK SWitCuiG CODE---------------- -------- 

// Bound a potential task to switch to 


// Make sure it's OK to switch 
if ( found taek == cubrent task ) 


if ( there is a locked task, and it's not the eurrent task } 


// It's OK to switch tasks now 


Delete & re-insert task into IDB list to give it proper cegriey, 


Call function that checks the following 


save control word in outgoing TDB 
if ( Current disk not eet in TDB ) 
Save the current DOS disk/directory in the TDB 
Call routine that sends task switch out notification 
Set new Windows current TDB value 
Set new Windows PDB value from the PDB value in the incoming TDB 


load control word from incoming TDB 
Call routine that sends task switch in notification 
Switch to S&S:SP from incoming TDB 


Tell the display driver that we've switched // In KRNL386 only 


Restore registers from stack 


SCHEDULER 


Instead, the event count is an indicator 
to the scheduler that the task needs to 
be woken up, because something is 
waiting to be processed. 


How do You Get into the Scheduler? 


Most applications are written without a 
thought as to how the application will 
give up control of the CPU, and how it 
can be a good citizen in the Windows 
environment. The reason is that the ba- 
sic structure of most Windows applica- 


gero it otit in TDB 


) 


tions is based upon the GetMessage(/ 
DispatchMessage() loop. Inside GetMes- 
sage), if no messages are waiting to be 
processed, and if no “hardware” or Paint 
messages are waiting to be synthesized, 
then the task is put to “sleep.” When 
the task wakes up, there is input for it, 
waiting to be processed. 

The act of sleeping consists of loop- 
ing around, waiting for a particular set 
of wakeup flags to be set in the mes- 
sage queue of the task. The waiting is 
accomplished by calling the semi- 
documented WaitEvent() function, 
which calls the core scheduling routine. 
We'll discuss this core routine in ex- 
haustive detail later. If the event count 
in the task database is nonzero upon 
entry to WaitEventO, then WaitEventV 
doesn’t call the core scheduling rou- 
tine, but instead returns without yield- 
ing. Each time it’s called, WaitEventO 
decrements the event-count field in the 
task database by 1. 

Besides GetMessage(), several other 
functions result in the scheduler getting 
invoked: 


e SendMessage(). If the message to be 
sent is destined for a task other than the 
current one, SendMessage() calls Di- 
rected Yield(V), which forces the destina- 
tion task to be woken up and become 
the current task. By calling PostEventO, 
SendMessage() ensures that the desti- 
nation task will be scheduled by the call 
to Directed Yield, 

e PeekMessage(). Much of the code in 
PeekMessage() repeats that in GetMes- 
sageV). The PM_NOYIELD flag can be 
used with PeekMessage() to prevent a 
task switch from occurring. PeekMes- 
sage() is commonly used to allow oth- 
er tasks to execute while your program 
is performing a lengthy chunk of work. 
The perfect example of this is doing a 
compile within one of the Windows- 
based integrated-development environ- 
ments, such as Turbo Pascal for Win- 
dows, or Microsoft QuickC for Windows. 

e The dialog-box functions. These in- 
clude all the variations of DialogBox(, 
as well as MessageBox(). These routines 
all use a PeekMessage()/ DispatchMes- 
sage() loop inside the USER module. 
When you call one of these functions, 
it is entirely possible that your task may 
be switched away from for a while. 

e The Yield family of functions. 
Yield— and DirectedYield( are “layers” 
on top of the undocumented O/dYieldO 
function. OldYieldO calls the core sched- 
uling routine. The key difference be- 
tween DirectedYieldO and Yield is that 
DirectedYieldO sets a field in the cur- 
rent task database that specifies which 
hTask the scheduler should attempt to 
transfer control to. Besides SendMes- 
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sage(), DirectedYieldO is also used by 
the WINDEBUG/CVWIN/TDWIN DLLs 
to switch between the debugger and 
the debuggee. It is also used by the 
Toolhelp TaskSwitchQ) and Terminate- 
AppO functions. 


Inside the Core Scheduling Routine 
At this point, we'll dive into details of 
the actual Windows scheduler. The func- 
tion is called RescheduleQ() and is not 
exported by the KERNEL module. In this 
article, I will be discussing the imple- 
mentation of Reschedule() as it appears 
in Windows 3.1 KRNL386.EXE. Except 
as otherwise noted, KRNL286.EXE has 
the same code. Figure 1 provides pseu- 
docode for the scheduler. It is reason- 
ably close to what actually occurs in 
keschedule(). | suggest you examine it 
with parmesan cheese close at hand, 
because it’s a spaghetti coder’s dream! 
keschedule( takes no parameters, and 
returns no values. Any routine that calls 
Reschedule() has no idea of what hap- 
pens between the time Reschedule() is 
CALLed and when the next instruction 
begins executing. RescheduleC) may de- 
cide that there is no reason to switch 
tasks, and simply return. Alternatively, 
it may switch away from a task for hours 
on end. To the caller, Reschedule() is 


an atomic operation. Time effectively 
does not exist for the task when it is in 
keschedule(). 

To provide this transparency of op- 
eration, as well as keep the system flow- 
ing smoothly, Reschedule() has to ac- 
complish four things: 


e Find the next task that should be 
scheduled, and make sure no “ex- 
traordinary” conditions prevent it from 
being switched to. 

e Save the complete state of the “out- 
going” task, and restore the state of 
the “incoming” task. Not doing so re- 
sults in chaos. 

e If no task needs to be scheduled, go 
into an “idle loop” that allows back- 
ground actions to take place. 

e Update the values that Windows main- 
tains for the current task. 


Unfortunately, the code to do each of 
these things is not broken up into dis- 
tinct sections. Instead, the work for each 
of the above items is completely inter- 
meshed, and lies in various states of 
completion throughout much of Re- 
schedule). In the following section, I'll 
cover each of the steps taken by 
keschedule(), while also trying to keep 
the big picture in mind. 


Entering the Scheduler 
Upon entry to Reschedule(), the fol- 
lowing registers are saved on the stack 
of the calling task: BP, DS, SI, DI, AX, 
CX, ES, BX, DX. Furthermore, the act 
of calling Reschedule() causes the CS 
and IP registers to be placed on the 
stack. The ToolHelp routines TaskGet- 
CSIPO and TaskSetCSIPQ rely upon the 
stack frame created by Reschedule() to 
obtain and set the CS:IP values to be 
used when the task starts up again. The 
32-bit registers are not saved across task 
switches. If your code uses the 32-bit 
registers, then it’s important that you be 
aware Of when a task switch might oc- 
cur and plan accordingly. 

After saving the registers on the stack, 
a routine is called that updates the pro- 
file CIND files if they have been changed 
since the task was last switched to. Fol- 
lowing that, the current hTask value 
maintained by Windows is loaded into 
DS, and the task-database signature 
found at offset OFAh is verified to be cor- 
rect. If the task database (TDB) looks 
incorrect, then Reschedule( immediate- 
ly jumps to the code that begins look- 
ing for a new task to switch to. If the 
signature bytes in the TDB look okay, 
then the WORD at offset OAAH is ze- 
roed out, but not before retrieving its 
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value into AX. This value is nonzero if 
Directed YieldO was called, and indicates 
which hTask the caller wants scheduled 
next. However, RescheduleQ) will not 
blindly schedule this new hTask; it first 
verifies that the event count in the po- 
tential new TDB is not 0. If it is, then 
Reschedule() treats this invocation as if 
it were invoked by a regular YieldO call. 
All this implies that the task specified in 
a DirectedYieldQ call may not be the 
one that’s scheduled next. As stated in 
the Windows 3.1 documentation, if you 
really want to make sure a particular task 
is scheduled, then use a PostAppMes- 
sage() to ensure that the event-count 
field in the TDB is nonzero. 

The next major section of code fol- 
lowing this “startup” sequence is the 
heart of RescheduleQ), and is responsi- 
ble for finding a suitable task to switch 
to. As mentioned earlier, all the task 
databases in the system are kept in a 
linked list. Reschedule() starts at the 
head of the chain, and iterates through 
each TDB, looking for one with a 
nonzero event count in the WORD at 
offset 6. Note that this field is not the 
same as the message-count value stored 
in the message queue of each task. For 
the remainder of this article, I'll refer to 
this walking of the task list as the “walk- 
through-task-list” code. When a TDB 
with a nonzero event count is found, 
Reschedule() attempts to switch the cur- 
rent task to that TDB. 


The Idle Loop 

If no events are waiting to be processed 
and all the tasks have been iterated 
through, then the walk-through-task-list 
code falls into the idle-loop section. The 
actions of this section of code depend 
on whether you’re running in Standard 
or Enhanced mode windows, and on 
whether you’re using virtual memory. 
The code described in the next two 
paragraphs exists only in the KRNL386 
.EXE version. 

Upon entering the idle loop, the code 
checks the values in some of the KER- 
NEL paging flags, and if conditions are 
right, calls an internal routine called 
ShrinkHeapO which walks the global 
heap. If it finds enough free memory, 
it unlinks the free blocks from the glob- 
al heap. 

If the paging system is in use, and if 
certain other paging flags are set, then 
the idle loop proceeds to call another 
internal routine: DiscardFreeBlocks(). 
The job of DiscardFreeBlocks() is to find 
free blocks of paged memory and give 
them back to the DPMI server. During 
the call to DiscardFreeBlocks(), hard- 
ware events may have occurred. To deal 
with this, RescheduleC) JMPs back to the 
walk-through-task-list code and starts 


anew the process of looking for a suit- 
able task to switch to. 

Having done the previous heap 
housekeeping, Reschedule() makes a 
call to the USER routine JsUserIdleQ). In 
Windows 3.1, this is where USER places 
its checks to see if a screen saver should 
be activated. The return value from Js- 


The Windows 
scheduler should not 
be confused with the 

virtual-machine 
scheduler that runs 
in Enhanced mode 


UserIdle() is a BOOL, and is True if a 
mouse button is held down, False oth- 
erwise. 

If the return from JsUserldle() is True, 
and if a flag called fPokeAtSegments is 
nonzero, then once every 20h times 
through the idle loop, an internal rou- 
tine called PokeAtSegments() is called. 
This routine walks the module list, and 
loads any discardable segments that 
haven’t been previously loaded. The 
module-list walk stops when it encoun- 
ters the SHELL module. This presumably 
causes PokeAtSegments() to load seg- 
ments only for the modules required for 
Windows to boot up. Once all the seg- 
ments have been loaded, the /PokeAt- 
Segments flag is set to 0, so that the idle 
loop doesn’t try to call PokeAtSegments() 
any more. If PokeAtSegments loaded a 
segment during a particular iteration of 
the idle loop, then the code JMPs back 
to the walk-through-task-list code. 

Two things remain to be done in the 
idle loop. The return value from Js- 
Userldle() is saved away on the stack 
before calling INT 28, the MS-DOS idle 
interrupt. When the INT 28h returns, the 
IsUserldleQ( return value is retrieved from 
the stack and used to decide what will 
be in the BX register for an INT 2Fh, 
AX=1689h call. If IsUserIdleC returned 
True, then BX=0, otherwise BX=1. The 
INT 2Fh, 1689H call is documented in 
the INT2FAPIL.INC file in the DDK, as 
the “Windows kernel idle call.” When 
the INT2Fh returns, the idle loop JMPs 
to the walk-through-task-list code. 


We've Found a Task...Now What? 

Upon finding a TDB that has a nonze- 
ro event count, Reschedule( starts the 
process of saving away the “context” of 
the outgoing task and restoring the con- 
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BEFORE PROTECTING 
YOUR SOFTWARE... 


...against piracy and unauthorized use, 
make sure that your protection system has 
all the following qualities: 


A GOOD HARDWARE KEY 


Hardware-based software protection 
systems are now the standard worldwide. 
However, not all keys are the same. A good 
key should have all the 
following features: 


iii 


Y Compatibility and transpa- 
rency. The key should work 
without any problem on your. 
customers’ computers. The 
user should be able to forget 
the key after connecting it. 





V Unbreakable electronics. A 
customized ASIC (Application 
Specific Integrated Circuit) 
component should be 
integrated in the key. This 
4 prevents reverse engineering 
and makes cracking virtually 
impossible. 






| WA unique and inaccessible 
software developer’s code burnt into the 
ASIC. (This code should not be held in the 
key’s memory, where it can be read and 
altered.) 


V A Read/Write Memory inside the key 
should be available on demand. The 
memory should be writable in the field, on 
any PC, without any special programming 
equipment. 


Y Very low power consumption, enabling 
the key to work even under the worst 
power conditions, on PCs and laptops, with 
or without a printer. , 


POWERFUL 
SOFTWARE 


Since it’s practi- 
cally impossible to 
crack or duplicate 
a key having all 
the features 
mentioned above, a 
pirate will usually go for the 
software linking the protected 
program to the key. Therefore, 
check that your protection soft- 
ware has all of the following: 


V A Linkable Protection Module 
with which calls can be made to 
the key from any point in the 
protected program. 
















Y An ‘Envelope’ installation program. 
Such programs enhance security while 

making it possible to protect a software 
even without its source code. 


Y Sophisticated antidebugging and 
encryption mechanisms. 
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SOFTWARE PROTECTION 
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MORE: 


HASP was designed by a team of computer 
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protecting 4 
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programs 
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V A Pattern 
Code Security 
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DOS, SCO Xenix & Unix-386, OS/2, 
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protecting DOS and WINDOWS software 
under network environments, including 
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IN A TEST CARRIED OUT BY 
AN INDEPENDENT LAB, 


HASP WAS THE ONLY KEY 
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(continued from page 68) 
text of the incoming task. Before this 
can be done, though, a few things need 
to be checked. If the incoming task is 
the current task, then there’s no reason 
to go through the full process of saving 
and restoring the task context. Instead, 
Reschedule( restores the registers saved 
on the stack upon entry and returns. 
The next important thing to check is 
whether there are any “locked” tasks in 
the system. A locked task is the only 
task in the system allowed to receive 
messages. All other tasks are shut out 
until the task is unlocked. A task can be 
locked by the LockCurrentTaskQ) in 
KERNEL, or by the LockMyTaskO func- 
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tion in USER. A system modal message 
box is an example of a locked task in 
action. RescheduleC checks whether any 
tasks in the system are locked. If the in- 
coming task is not the same as the 
locked task, then the saved registers on 
the stack are restored, and RescheduleQ 
returns without having switched tasks. 

The last thing checked before start- 
ing the process of switching tasks is a 
check to see if KERNEL is currently in 
its INT 21 handler. This is accomplished 
by checking the KERNEL_INDOS flag, 
which is also checked in other “critical 
sections” of code elsewhere in KERNEL. 
If KERNEL is currently in its INT 21 han- 
dler, the walk of the task list is resumed 
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at the next task in the list of tasks, rather 
than at the start of the task list. It’s not 
readily apparent why the walk doesn’t 
start again at the head of the task list. 
Once we’ve gotten past the above 
gauntlet of idle loops and other checks, 
Reschedule( now starts the actual pro- 
cess of saving away the context of the 
current task and waking up the incom- 
ing task in the same state in which it 


Unlike the OS/2 
scheduler, the 
Windows scheduler 


is fairly “flat” 


was left. The switching code is as much 
of a “critical section” as any other piece 
of code, so it is only fitting that a glob- 
al variable InScheduler is incremented 
to indicate that work is in progress. The 
first thing that the switching section does 
is readjust the priority of the task. The 
task priority is given by the WORD val- 
ue at offset 8 in the TDB. Tasks with a 
lower priority number will get an op- 
portunity to run before tasks with high- 
er priority numbers. The priority of a 
typical task is 0, but it can range be- 
tween —32 and 15. The priority of a task 
can be set by the undocumented Set- 
PriorityO function in KERNEL. Up un- 
til now, I haven’t discussed how this 
priority system is implemented, at least 
not explicitly. Earlier, I mentioned that 
the list of tasks is walked, looking for 
the first task that has at least one event 
waiting for it. As it turns out, the task 
list is always kept in priority-sorted or- 
der. The tasks with the lower-priority 
numbers come first in the list, thereby 
causing their TDB to be checked for 
events before tasks with higher-priority 
numbers. Reschedule( causes the task 
list to be kept in order by momentarily 
deleting the incoming TDB from the task 
list and then reinserting it. The function 
that inserts tasks into the list inserts the 
TDB in priority order. 

Because most tasks share the same 
priority value, it is important to give 
them each a fair chance to be selected 
for scheduling. This is done by incre- 
menting the priority value in the in- 
coming TDB before removing/reinsert- 
ing it into the task list; after reinsertion, 
the priority is decremented back to its 
previous value. This has the net effect 
of placing the incoming task later in the 
list than any other tasks that share the 
same priority value. (A lower numeri- 
cal value means a higher priority.) 

Upon reprioritizing the TDB list, 
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Reschedule() next sets a flag in the seg- 
ment that manages the information for 
the global heap. By setting this flag, the 
global heap LRU functions are pre- 
vented from changing the global heap 
during the remainder of the task-switch- 
ing code. 

At this point, it is time to finish sav- 
ing the context of the outgoing task. As 
you will recall, most of the 16-bit reg- 
isters for the outgoing task were pushed 
onto the stack upon entering Resched- 
ule(). The remainder of the job involves 
three steps. First, the current SS:SP val- 
ues are placed into the DWORD at off- 
set 2 in the outgoing TDB. Then, an in- 
ternal routine called SaveStateO is called. 
Inside SaveStateC), the FSTCW instruc- 
tion is used to save the control word of 
the 80x87 into the outgoing TDB. Of 
course, this is only done if an 80x87 is 
installed. I use the word “installed” to 
mean that the math-coprocessor bit was 
set when KERNEL did an INT 11H at 
boot time. Additionally, if the high bit 
of the current drive field in the outgo- 
ing TDB is not set, then INT 21h, 
AX=19h, and INT 21h, AX=47h are 
used to store the current drive and di- 
rectory values into the outgoing TDB. 
The final act of the outgoing process is 
to call a routine that causes the “out- 


going-task” notification to be sent. This 
notification can be caught by TOOL- 
HELP.DLL as an NFY_TASKOUT notifi- 
cation, or by a RegisterPtraceL/ Toolhelp- 
Hook() callback function with AX=0Dh. 

RescheduleO now turns to waking up 
the incoming task. As you might expect, 
the steps to start it up are almost a mir- 
ror image of the steps to save away the 
outgoing task. First, the global variable 
containing the current TDB is set to the 
value of the incoming TDB. Next, the 
PDB (or PSP if you prefer) of the in- 
coming task is retrieved from the in- 
coming TDB, and stored in the global 
variable that Windows uses for its cur- 
rent PDB value. If an 80x87 is installed, 
the control word for the incoming task 
is loaded from the TDB via an FLDCW 
instruction. Continuing on, the corre- 
sponding “incoming notification” is sent 
(NFY_TASKIN for Toolhelp, A=0Eh for 
the RegisterPtrace()/ ToolhelpHook(Q call- 
backs). 

The SS:SP registers are switched to 
the values stored away in the incoming 
TDB. The InScheduler flag is decre- 
mented. If we’re running KRNL386.EXE, 
the UserkepaintDisable( function in the 
Display module is called with a 0 pa- 
rameter, indicating that hardware up- 
dates are now okay. The flag that pre- 


vents mucking with the global heap is 
decremented. The final act of Resched- 
ule() is to pop the register values for 
this task off the “new” stack. 


Summary 

As you might have guessed, the Win- 
dows scheduler is integrally tied to many 
parts of Windows. In our journey, we 
have glanced over many areas, includ- 
ing tasks, modules, the global heap, no- 
tifications, and interrupts. All are wor- 
thy of study in their own right. If you 
really wish to understand how Windows 
works, you must be familiar with all of 
them, as well as many other things. It’s 
a steep learning curve, but in the end, 
I believe it’s worth it. Hopefully this ar- 
ticle has given you some idea of what 
really goes on beneath the hood and 
inspired you to do more exploration on 
your Own. 


References 

Schulman, Andrew, David Maxey, and 
Matt Pietrek. Undocumented Windows. 
Reading, MA: Addison-Wesley, 1992. 


DDJ 


Vote for your favorite feature/article. 
Circle Reader Service No. 5. 


MEMORY LIMEMS HMA 0OS5.0 XMS BUFFERS UMB~ BLUEMAX QUALITAS : 386MAX 486SX DRIVERS 386SX NETWORKS TSRs_ C&T 286 










WV¥ 

















. maximum memory without using risky “magic 3 
5 | 0Se p tricks.” The only one that works with total > 
= reliability under every major memory spec, includ- 
2 ing DPMI, VCPI and EMS. x 
* ne AUT r y fil a ia Q AT Which are all good reasons to make the same = 
N choice in memory management yourself — S 
= 386MAX°® 6 for PC compatible 386 and 486 systems, ® 
; nd or BlueMAX™ 6 for all 386 and 486 IBM PS/2s. 5 
| Microsoitchose. Sis isws7-s3ee 
“ * or call toll-free. -HO0-b/ B38 2 
= icrosoft® includes just one memory \ Ure \ 

= manager with their revolutionary new 

a C/C++ 7.0. 

2 It’s the memory manager that earned 

m PC Magazine Editors’ Choice for 1992, 

ei the BYTE Magazine Award of Merit, and 

ys 0Uté~<C«S unqualified praise from . : 

: a InfoWorld and PC Week. | | 7 
ws * ‘The one that intel- bs = 
= 7 * : ™ RZ 
| Sede vesseoes'te The Intelligent Memory Managers: 
o : DOS5.0 XMS BUFFERS UMB 386MAX 486SX DRIVERS 386SX NETWORKS TSRs BLUEMAX C&T286 UMB 386MAX  386SX 


© 1992 Qualitas. Qualitas, 7101 Wisconsin Avenue, Suite 1386, Bethesda, MD 20814. All company and product names are trademarks or registered 
trademarks of their respective owners. System Requirements: Any 386, 386SX, 486 or 486SX PC or PS/2, min. 256K of extended memory, DOS 
3.0 or higher, and hard disk drive. (386MAX also supports 286 machines with Shadow RAM, or any PC with EMS 4.0 hardware with 256K 
memory. Feature availability and memory recovery may vary on these systems.) 


CIRCLE NO. 669 ON READER SERVICE CARD 














EMBEDDED SYSTEM 


Mine fron _ 


Assembly to C 





A technical-support veteran discusses using C for embedded systems 


ost embedded systems pro- 


grammers learned their craft _ 
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programming. Why? Because - 
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were the only tools around for writing — 
code for target systems. Now, howev- | 


er, high-level language tools specifical- 


ly designed for embedded systems de- 


velopment are available. Consequently, 


C is increasingly becoming the language ' 
of choice for many embedded systems _ 


projects. 
What’s C’s attraction? For starters, C is 


more maintainable and portable—and 
just as efficient—as assembly. Still, many 


assembly programmers shy away from 


moving to C, citing its steep learning 


curve or the lack of language facilities. 


This article has its roots in my two — 
years on the front lines of Intermetric’s © 
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although much of the information ap- 
plies to other 8-, 16-, and 32-bit pro- 
cessors as well. 
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computer and compile for use with any 
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wee §=your application. If you plan on using 





not all of these processors offer the same 
support for the C language, you need 
to first understand how C handles the 
stack, recursion and reentrancy, the 
heap, C keywords of special interest to 
embedded systems programmers, the 
runtime startup, automatic data initial- 
ization, locating, function prototypes, 
and library support. You then need to 
decide how important these are to your 
application. 

For example, most C compilers use 
a stack. As in the assembly world, the 
stack is used for temporary storage. In 
C, the stack is used to pass parameters 
and as storage for local variables (called 
“autovariables” in C). Because of the 
dependency on a stack, you must con- 
sider your processor when planning 


| on-chip RAM only (256 bytes on the 
_ M68HC11A8), you'll need to minimize 
_ your use of local variables, parameter 
_ passing, and nested subroutine calls. 
However, if you'll be using external 
_ RAM, you can allocate a section of 
_ memory there for stack usage. 


Alternatively, many C compilers can 


_ generate code that uses static memory 
instead of a stack. This option minimizes 
_ the RAM requirements of an application, 
_ but generally makes functions compiled 
_ this way non-recursive and non-reen- 
_ trant. If your application does not re- 
| quire recursion or reentrancy, you can 
take advantage of compile-time models 
_ that use static memory for parameter 
_ passing and local variables. Otherwise, 
you'll have to consider stack usage when 
~ deciding how much RAM your applica- 


tion will need. 

A program is recursive if the com- 
piled program can call itself during run 
time and execute correctly. The facto- 
rial program in Example 1 shows a re- 
cursive routine. If the local variable 
temp is stored on the stack, the pro- 
gram will execute correctly. Each time 
the program calls itself, storage for 
temp will be created in a new area of 
the stack. However, if temp is stored 
in a static area, say memory location 
0x2012, each successive call to the pro- 
gram may overwrite the previous val- 
ue of temp (depending on the evalu- 
ation order of the compiler). In this 
case, the factorial of any number 
would be 1. 

A program is reentrant if it can be 
entered by more than one task simul- 
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Figure 1: Stack and heap in memory. 


int factorial (int fact) 


int temp; 
temp=fact; 
at (ifsco). 
_temp-1; 
elee ~~ 
‘enpt-factorial (temp-1) 
_fetucn Lee 
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(continued from page 72) 

taneously without loss of information. 
The factorial program in Example 1 is 
reentrant if each task that calls factorialO 
has its own individual stack area. Reen- 
trancy is important in the embedded 
systems world if you plan on calling 
functions from both your main appli- 
cation code and your interrupt service 
routines, or if you will be using an op- 
erating system and running concurrent 
tasks. 

The heap is an area of RAM used by 
the C runtime library routines mallocO 
and calloc() to request memory dy- 
namically. This is useful when you do 
not know the size of an array or struc- 
ture at compile time and would like to 
avoid preallocating a large area of RAM 
that may not be used. The heap is an 


/* face = 0) «/ 


/* fact > @ Ff 





Example 1: Factorial program that illustrates recursion. 


area of memory, similar to the stack, that 
must be set up before your program be- 
gins executing. Typically the heap starts 
a certain number of bytes away from the 
stack pointer; during program execution, 
the stack and heap grow toward each 
other, as shown in Figure 1. In this sce- 
nario, it’s common for the stack and 
heap to collide, which will generally 
cause unexpected results! If you plan on 
using memory-allocation routines, you'll 
need to carefully consider the stack and 
heap when planning the RAM require- 
ments of your application. 

One of the benefits of using C is that 
you will now have access to the many 
data types in the language. These in- 
clude integer types: char, short int, int, 
and long int; and floating-point types: 
float and double. Combining these da- 
ta types with C’s array, structure, and 
pointer facilities lets you define com- 
plex data structures and manipulate 
them easily. 

For the most part, you may define and 
access your program variables without 
concern for how they are stored, as the 
compiler will treat each type consistent- 
ly. If you want to manipulate your data 
from outside the domain of the C com- 
piler, however, you'll need to know how 
the compiler represents the different da- 
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ta types. For example, is an int a 16- or 
32-bit quantity? (Some compilers allow 
you to specify this at compile time.) Are 
floating-point numbers stored in IEEE 
Standard form or in a form used only 
by the compiler vendor? You'll need to 
know the answer to these questions if, 
for example, you plan on processing da- 
ta from an A/D converter which may 
use a format incompatible with your pro- 
gram. If this is the case, you'll need to 
write an assembly-language routine that 





Example 2: (a) An error message will result if your 
program tries to modify a variable that has been declared 
const; (b) using extern and const to tell the compiler that 
pi bas been defined outside this module. 


ASM 10 ( 


will translate the data into the format 
used by the compiler. 

There are four keywords in C that are 
of special importance to the embedded 
system programmer—const, extern, 
volatile, and register. These keywords 
modify your variable declarations and 
notify the compiler that the particular 
variable has a special property. 

The const keyword declares that a 
variable cannot be modified during pro- 
gram execution. As the name implies, 
this is most often used in embedded 


systems programming to declare that 
the variable is a constant that will not 
be changed and can therefore be stored 
in ROM. Declaring strings as const may 
be helpful if you are trying to minimize 
RAM usage. A compiler will generate 
an error message if your program tries 
to modify a variable that has been de- 
clared const, see Example 2(a). The ex- 
tern keyword identifies variables defined 
in another module. In Example 2(a) I’ve 
defined pi in one module. If pi is ref- 
erenced in another module, tell the com- 





Example 3: The result of declaring porta as volatile. 
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(continued from page 706) 

piler that pi has been defined outside 
this module and that it is a const type. 
Otherwise the compiler is free to as- 
sume that pi is a regular float variable 
that can be modified. This is done by 
specifying both extern and const in the 
second module, as in Example 2(b). 

As a general rule, external declara- 
tions should be the same as the origi- 
nal definition, except that you need to 
include the keyword extern and omit 
any initialization. 

The volatile keyword is used to de- 
clare that references to a variable should 
not be optimized away. This is impor- 
tant in embedded systems when a vari- 
able may be memory-mapped I/O or 
storage shared between independent 
tasks. Example 3 shows the result of 
declaring porta as volatile. Here, the 
assembly code generated for C line 7 
has been optimized out. Because porta 





was declared volatile, the code for the 
second assignment to porta was not 
deleted. 

The register keyword is used to spec- 
ify that a variable should be stored in a 
register, as it will be used extensively in 
the program. Placing a heavily used vari- 
able in a register can reduce code size 
and increase execution speed. Howev- 
er, many compilers will ignore the reg- 
ister keyword if the target processor has 
a limited number of registers. 

Even if you write your entire appli- 
cation in C, one minimal piece of as- 
sembly code will generally be required. 
This is the runtime startup program, and 
its purpose is to set up the environment 
for your C program. At a minimum, the 
startup will load the stack pointer. Your 
runtime startup may also include code 
to manipulate target registers such as 
the M68HC11 EEPROM block-protect 
register, BPROT, that must be zeroed 


Example 4: Most C compilers make a distinction between initialized and 


uninitialized global variables. 


within the first 64 cycles after reset to 
enable EEPROM writes. 

Data manipulation may also occur in 
the startup program. Many startup pro- 
grams will zero out any uninitialized 
variables you have defined, since the C 
language specifies that these variables 
have the value 0 when the program be- 
gins executing. The startup routine may 
also be the place to set up initialized 
global variables. This is called automatic 
data initialization. 

Most C compilers make a distinction 
between initialized and uninitialized 
global variables; see Example 4. In the 
embedded systems world, this is be- 
cause variables that will be modified 
must reside in RAM, yet RAM contents 
are usually erased or corrupted when 
power is removed. For uninitialized 
global variables, this is not a problem, 
as you do not expect these variables to 
have any meaningful value at startup 
time (other than 0). However, this is not 
true of initialized variables. These must 
have the value that you specified in your 
program. 

One solution is to store the values of 
these variables in ROM and then ini- 
tialize the RAM locations during the run- 
time startup process. In order to mini- 
mize the ROM storage requirements, at 
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compile time initialized global variables 
are typically generated into one data 
section, and uninitialized global vari- 
ables are generated into a different da- 
ta section (often called the “bss” sec- 
tion—a carryover term from the original 
C compiler development on the PDP- 
11). Only variables in the initialized da- 
ta section need to have their values 
copied into ROM. Figure 2 illustrates a 
typical M68HC11 memory map before 
and after the runtime startup has copied 
data from ROM to RAM. 

One key feature of ANSI C compilers 
is their support for function prototypes. 
Function prototypes are function dec- 
larations that provide an explicit list of 
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a function’s parameters and their types. 
The benefit of providing a function pro- 
totype is that it enables the compiler to 
check that the arguments passed to a 
function match the prototype, prevent- 
ing parameter mismatch problems be- 
tween calling and called functions; see 
Example 5. 

Function prototype checking means 
that the compiler can make its best ef- 
fort to comply with your prototype. If 
it can widen a parameter to match the 
prototype, it will do that. However, the 
compiler cannot truncate a parameter 
without loss of information, so it will 
flag those calls as errors. 

Most C compilers offer an extensive 
C runtime library that offers a variety of 
routines that you can call from your pro- 
gram. These include I/O functions like 
printf{O and scanfO, character functions 
like isalphaQ, string functions like str- 
cpyQ), and math functions like sgrtO. 
Most compilers load only those library 
routines referenced by your program, 
so you don’t have to worry about en- 
tire libraries being copied into your ap- 
plication. Some compilers also include 
the source to the C library so that you 
can modify routines if necessary. 

If you will be using the I/O functions, 
you should check to see whether you 
will need to modify any routines. You 
may need to modify getchar() and 


putchar( for your target hardware; the 


other I/O routines are usually coded to 
call putchar() and getchar() and don't 
need to be changed. Example 6 shows 
C versions of putchar and getchar for 
the M68HC11. 

Once your program has been com- 
piled, you will need to link and locate 
it. In assembly programs, you may have 
used an assembler directive like ORG 
to locate your code and data sections 
in one program file. In C, compiled code 
is usually relocatable—this means that 
the location of instructions and variables 
is not known at compile-time. Locating 
code and data is the job of the linker. 

Compilers make this job easy by al- 
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Example 6: C versions of putchar and getchar for the M68HC11. 
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(continued from page 80) 

lowing you to locate code either by 
module or by section. This capability 
means that you can change the loca- 
tion of your program simply by relink- 
ing—your code does not need to be 
recompiled. However, this does mean 
that some code will need to be in a 
separate module. For example, in the 
M68HC11, the interrupt vector table is 
located at OxFFD6. Generating the ad- 


dresses for this table usually means 
compiling or assembling a separate 
module that includes only the address- 
es for the interrupt service routines. 


Combining C and Assembly 

Often, the size or speed of a particular 
routine is critical, and you want to use 
assembly language because it offers 
maximum control. Most C compilers 
support a variety of features that let you 





can program an ‘“‘external environment” 


to interact with your code to simulate your 


target system. The emulator is the hard- 
ware extension of the simulator! 


In-Cireutt Emulation 





The 30MHz real-time emulator has been _ 


the industry standard for years. With its 
complex breakpoint logic and advanced 


trace, nobody can beat it for performance. 
Plug-in or RS-232 configuration. All 8051 


derivatives are supported! 


noHaU 


CORPORATION 


51 E. Campbell Avenue, Campbell, CA 95008 
FAX (408) 378-7869 


(408) 866-1820 e 


“the DEMO. You can load u up to 64K of 
code and use 64K of XDATA space. You. 









Belp File papriiter i 


#64 void moving _demo() { 
P0042 £4 CLR A 


8043 19 MOU Can, A 
be for( pi = arrayX, c = sizeof( array 
004 








iA 62 MOU .pi, #02 
1B 66 «MOU 618,800 
1c 06 6MOU «61C, 808 
1893F WU .c,t3F 
18 MOV A.C 
MOU sOR?,A 
) 


g 
GO [FROM addrenas 


Australia (02) 654 1873, Austria (0222) 38 76 38, Benelux +31 1858-16133, Canada (514) 689-5889, 
Czechoslovakia 0202-2683, Denmark (42) 65 81 11, Finland 90-452 1255, France (01)-69 41 28 01, Germany 
08131-25083, Great Britain 0962-73 31 40, Greece 01-862-9901, Hungary (1) 117 6576, Israel (03) 48 48 
32, Italy (011) 771 00 10, Korea (02) 784 784 1, New Zealand (09) 392-464, Portugal 01-80 9518, Norway 
02-649050, Singapore (065) 284-6077, Spain (93) 217 2340, Sweden 040-9224 25, Switzerland (01) 740 41 05, 
Taiwan (02) 7640215, Thailand (02) 281-9596, Yugoslavia 061 621066. 


CIRCLE NO. 554 ON READER SERVICE CARD 


82 


/ * iis declared as an int */ 





Example 7: (a) Using this C 
declaration, an MOSHC11 C compiler 
will generate the code in (b); (b) 
generated code; (c) a subroutine call 
to func. 


combine the best of C and assembly. In 
addition to becoming familiar with these 
features, you'll need to understand the 
compiler’s interface conventions so that 
you can call assembly programs from C 
and vice versa. 


In the assembly 
world, the stack is 
used for temporary 
storage 


There are three main interface con- 
ventions to be aware of when combin- 
ing both C and assembly-language rou- 
tines: naming global symbols, passing 
parameters, and returning values. (The 
following examples may or may not be 
valid for the compiler you use.) 

Many C compilers automatically pre- 
pend an underscore character to all 
global symbols defined in C. This is 
done to avoid unintentional conflict with 
symbols defined at the assembly level. 
In the C declaration in Example 7(a), a 
M68HC11 C compiler will generate the 
code in 7(b). If your assembly routine 
needs to reference a variable defined in 
C, you will need to generate an exter- 
nal instruction in your assembly code 
that includes the underscore. 

Similarly, a subroutine call to func) 
will look like Example 7(c). Therefore, 
if func( is written in assembly language, 
you will need to publicly declare it as 
_func within your file. 

If you write your entire application in 
C, you won't have to worry about pa- 
rameter passing since the generated 
code will observe the conventions de- 
fined by your compiler. If you plan on 
writing some of your routines in as- 
sembly language, however, you will 
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need to learn what these conventions 
are and write your code accordingly. 
Typically, C compilers will pass param- 
eters by pushing them on the stack in 
reverse order (last parameter pushed 
first). This is how your assembly pro- 
gram will need to receive them. 

If there is only one parameter being 
passed, it may be passed in a register 
rather than on the stack. Also, many 
ANSI compilers will only pass parameters 
as a multiple of words. This means that 
variables defined as type char may be 
widened to type int; compilers may also 
widen other types, like float to double. 

One way you can discover how the 


compiler passes parameters is to look 
at the assembly code generated. Since 
these conventions vary widely from on 
vendor to another, writing a dummy C 
program like that in Listing One (page 
120) and compiling it to assembly code 
can save you some time in writing your 
own assembly programs. Note that if 
optimization is turned on, the generat- 
ed code may not be what you expect. 
If possible, compile without optimiza- 
tion to avoid code from being altered. 

Compilers also expect functions that 
return values to do so according to con- 
vention. In Listing One, subroutine re- 
turned an int passed in the D register. 
Other variable types are returned either 
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main() 


{ z 


_asm("sei\n"); /* disable interrupts */ 
fune(): /* call function*/ 
_asm("cli\n"); /* enable interrupts */ 


} # 





Example 8: Using inline assembly to 
disable interrupts before a call to a 
subroutine. 


in registers or on the stack, depending 
on the size of the return value and the 
registers available for the specific pro- 
cessor. 

Compilers often reserve processor 
registers for their own use. This is typi- 
cally done to point to global data areas 
or for use as an external stack pointer. 
You will need to verify which registers 
are reserved by your compiler and 
avoid using them in your assembly pro- 
grams. If you must use a reserved reg- 
ister, it is your responsibility to save the 
value of this register and then restore 
it before your program returns to the 
calling program. 

Many C compilers offer inline as- 
sembly support. This feature allows you 
to insert a line or more of assembly code 
within your C program. For example, 
suppose you wanted to disable inter- 
rupts before a call to a subroutine. Typ- 
ically this would look like Example 8. 

You can write your interrupt vector 
table in C or assembly language. Gen- 
erally, you will write a module that de- 
fines data locations that will contain the 
addresses of your interrupt service rou- 
tines. At link time, the linker will resolve 
these references and fill in the correct 
values. Listing Two (page 120) is a C 
version of a MO8HC11 interrupt vector 
table. This module can be compiled and 
then located at OxFFD6 to serve as a 
generic interrupt vector table and mod- 
ified later as you write your interrupt 
service routines. 

Finally, because many processors sup- 
port memory-mapped I/O, it is useful 
to be able to define variables in C whose 
location corresponds to a particular I/O 
register or port. This can be done in C 
by defining a pointer that references an 
absolute address. Listing Three (page 
120) shows how this is done. The ab- 
solute address specified in the #define 
statement will be substituted for the 
symbol value. 
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(Listings begin on page 120.) 
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Header prediction and forward-error correction for 


etworking hardware tech- 

nologies of the future promise 

data-transfer rates of gigabits 

per second in place of today’s 

megabits—yet many of the 
techniques used in our software and 
hardware architectures don’t scale well 
to these high data rates. The basic prob- 
lems we face are an order-of-magnitude 
increase in processor speed, coupled 
with a three order-of-magnitude increase 
in network bandwidth. So even if our 
network-protocol processing cost is re- 
duced to 1 percent (with today’s net- 
works), and the processor speed im- 
proves as expected, we'll still reach the 
saturation level, with no room left to 
run an application! 

Once we bring I/O and application 
demands into the picture, things be- 
come quite problematic. I/O is at the 
bottom of the computer system’s “food 
chain” for processor cycles. Applica- 
tions demand massive I/O and want to 
supervise the demand for the proces- 
sor as well. The obvious loser in this 
competition is the operating system, 
which is being squeezed from both the 
top (by applications) and bottom (by 
device hardware). 

In this article, I'll discuss problems in- 
volved with high-speed networking, 
focusing on certain software improve- 
ments in the Berkeley TCP/IP imple- 
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mentation that attempt to deal with some 
of the challenges. You can see this soft- 
ware in operation in any 386BSD system. 
Variations of this approach have already 
been used with FDDI (100 megabits/sec) 
and HiPPi (880 megabits/sec). 


Network Profusion Magnifies the Problem 
Ironically, the very success of comput- 
er networking has magnified the scale 
of these problems. While the number 
of host computers on networks is grow- 
ing at a more-or-less “linear” rate, the 
geographic span of computer networks 
is growing much faster. This problem 
isn’t restricted only to workstations; even 
in the PC arena, networks are a formid- 
able consideration. (Novell estimates 
that 45 percent of all PCs sold today will 





be connected to a network.) 

The transmission rates of new long- 
distance communications technologies 
cause the original ARPANET 56-kilo- 
bits/second network and the current 
T1 (1.544 megabit/sec) service com- 
mon on the Internet today to pale 
in comparison. Among these tech- 
nologies are T3 (44.736 megabits/sec- 
ond) and the Synchronous Optical Net- 
work (SONET), which is ordered in 
units of 51.84 megabits/second (OC-1), 
up to 2.488 gigabits/second (OC-48). 
(T1 has considerable historical signifi- 
cance, however, since it was created 
as the original digital “trunk” service 
between telephone exchanges—hence 
its name, T1.) 

All these new services result in more 
bits on the wire per unit of time. By in- 
creasing the data rate by three orders 
of magnitude, we now have literally 
tens of megabytes of data in flight be- 
tween any two urban centers. Since we 
can’t do anything to reduce the delay 
on a long-distance connection (unless 
someone learns to travel faster than the 
speed of light), we have to look to oth- 
er solutions. 

For example, let’s send a message 
from San Francisco to New York over 
different high-speed networks. Assum- 
ing a 70-millisecond “flight” time, we 
will find 12 Kbytes in flight for a T1 con- 
nection, 385 Kbytes for a T3, and almost 
20 Mbytes for an OC-48 connection! If 
we discover an error in this message 
upon reception, we are forced to re- 
ceive a barrage of useless data before 
the sender is informed of the problem 
and can respond. Worse yet, the sender 
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would need to remember the previous 
“in flight” data to be able to respond. 
Clearly, high-speed networks require 
new approaches to leverage their ad- 
vantages appropriately; otherwise, the 
problems begin to resemble those of 
the sorcerer’s apprentice. 


LANs Suffer Too, but Differently 

Not only are long distance networks get- 
ting faster, so are local area networks 
(LANs). FDDI (made more cost effec- 
tive by a twisted-pair 100 Base 2 ver- 
sion) holds the promise of 100 megabit/ 
second technology for workstations and 
PCs. (Indeed, PC LANs may exceed the 
PC disk-drive transfer rates in use!) Many 
workstation vendors are exploring ATM 
(Asynchronous Transfer Mode) and 
SONET to provide scalable LAN band- 
width. One method is to downsize the 
telephone-company model of commu- 
nications by using a simplified switch 
at the center of a star network radiating 
to workstations. The switch allows each 
“ray” of the star to be independently up- 
graded to faster bandwidths as they be- 
come available. 
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But the transition of LANs to very 
high-speed networks is not strongly af- 
fected by volume of traffic or delay; in- 
stead, high speed networks suffer from 
a different malady. A notable charac- 
teristic of very-high bandwidth LAN 
network transfers is for them to be 
“bursty” —usually, one transfer will com- 
mand all the resources of the comput- 
er’s processor and network I/O device 
for a brief period of time, to the exclu- 
sion of others. These “loudmouth” trans- 
fers (such as image downloads from a 
server or database group-commit oper- 
ations) command the attention of client, 
server, and any gateways and/or net- 
work in between. 

Another characteristic of LAN tech- 
nologies is the ever-present cost factor. 
While 100+ MIPS on the desktop is with- 
in sight, 15-20 MIP PCs and worksta- 
tions are far more realistic. If we could 
streamline network-protocol processing 
overhead, we might be able to leverage 
0.1-gigabit/sec LAN technologies avail- 
able on current workstations/PCs, and 
use the same trick for the 1-gigabit/100 
MIP LAN/computer combinations as 
they become available. 


1 LaserMaster utilizes a host- 
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CPU to process print jobs 
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to rasterize the page. 
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Knowing this, network and operat- 
ing-system designers can implement sin- 
gle-element caches at key points in the 
networking software implementation. 
This will “short-circuit” the lookup and 
evaluation of data structures in proces- 
sing a protocol. This approach favors 
the most common case, in which the 
packet being processed is tied to the 
program associated with the previous 
packet processed. This will clear out 
much of the overhead in our increas- 
ingly complicated network protocols. 

None of these problems are irrecon- 
cilable. In fact, there are many com- 
peting solutions we should be aware of 
as the world gradually migrates into gi- 
gabit networks. Both simple and com- 
plex solutions can be found by “think- 
ing gigabit.” 


Example: Single-element Caches in 
386BSD TCP 

A simple example of thinking gigabit 
can be found within 386BSD’s Trans- 
mission Control Protocol (TCP) imple- 
mentation, the current version of the 
University of California’s Internet net- 
working implementation. This protocol 


Attention Speed Freaks! 


emember your old 8086 PC? 


language compatibility. We even 
make an 800-dpi version of the 
WinJet, so you can more easily 
read six-point code. 


So stop relying on slow, outdated 
printer technology. Call 
LaserMaster today for your 
prescription for fast Windows 
printing! 


Call 800-365-4646 or | 
612-944-9330 today. 




















peed advantage in 
three important 
eines Ss ways. 






[L M )’ LAseRMASTER™ 


6900 SHADY OAK ROAD, EDEN PRAIRIE, MN 55344 


make your Windows™ printing 
better. Like full PostScript™ 


Bet oat | 


©1992 LaserMaster raat HP and LaserJet are registered trademarks of Hewlett Packard Corp. PostScript is a trademark 
of Adobe Systems, Inc. This product incorporates Microsoft Truelmage software with LaserMaster extensions for perform- 
ance. Microsoft is a registered trademark, and Truelmage and Windows are trademarks of Microsoft Corp. 


CIRCLE NO. 563 ON READER SERVICE CARD 
88 Dr. Dobb’s Journal, August 1992 





Power Programming Tools 






UNIX developers can never have too much power. So we 
offer more high performance programming tools and industry 
standard languages for open systems than anyone else. 

We also provide free technical support for registered users, 
as well as a 30-day money back guarantee. 

And all of our LPI languages come bundled with a free 
copy of our CodeWatch debugger. So call us today. And put the 
power of Liant to work for you. 

LPI-FORTRAN is the high performance, ultra-reliable 
FORTRAN-77 compiler providing a number of popular exten- 
sions, such as VAX-FORTRAN, FORTRAN-66, and UNIX access. 
Called "unbreakable" 


by UNIX Today!, our Supported Systems 
LPI-FORTRAN is X/Open : : 
compliant and 88open OCS 1386 SVR3 ) SVR4 
certified. LPI-FORTRAN i486 SVR3, SVR4 
supports the Weitek Abacus RS/ 6000 AIX 
coprocessor family and the SPARC SunOS 
Intel 80287/80387 M88000 SVRS 
COprocessors. SUN-3 SunOS 

LEE POR Ube can fot alammeeetek 
take total advantage of UNIX 


SVR4 including full shared library support and generation of 


LIANT 


Liant logotype, language logos, and CodeWatch are trademarks of Liant Software 

Corporation. UNIX is a registered trademark of AT&T. Copyright © 1992 by Liant 
Software Corporation, 959 Concord St., Framingham, MA 01701. 508-997-5801, 
FAX: 508-992-6936. 


CIRCLE NO. 178 ON READER SERVICE CARD 


for UNIX 














true native ELF object code — the first true FORTRAN imple- 
mentation of this on the i386 and i486. 
LPI-COBOL supports 
ANSI COBOL-85 and 
COBOL-/4, is X/Open 
CAE compliant and | 
gives extensions to comply other COBOL dialects. 
LPI-C** is a true 32-bit C++ compiler, based on a 
complete implementation of the AT&T 2.1 specs. It is compatible 
with cfront, ANSI C and K&R C code. 
LPI-PL/1 is a full implementation of the ANSI PL/I 
X3.74 1981 General Purpose Subset, 880pen OCS 
_ certified and contains DEC and IBM extensions for 
compatibility with existing applications. 











CodeWatch, our windowed source-level 
debugger, allows you to debug using the conventions 
and symbols of each language — including C++ 
overloading support. 

We also offer outstanding compilers for C, 

_ Pascal, and BASIC. 

30-DAY MONEY BACK GUARANTEE. 

| Order now and get Codewatch bundled free 
with your LPI compiler. Call for our product catalog. 

1-800-662-9866 








(continued from page 88) 
corresponds to level 4 of the famous 
OSI seven-level model (see Table 1), 
which concerns itself with the speed, 
quality, and reliability of data transfer 
between systems. This is the correct lay- 
er to attempt to accelerate, since its sole 
concern is moving information at the 
fastest possible rate. (See the textbox 
“TCP/IP In Brief .”) 

Associated with each TCP “session” 
is a pcb, or protocol control block. On 
receipt of a new TCP segment, we need 
to find the associated Internet pcb, or 
inpcb. We must search hundreds of pos- 
sible connections to find the appropri- 
ate buffer; see inpcbQ) in Listing One, 
page 122. This costly search could be 
shortened by assuming bursty behavior, 
checking whether the last TCP segment 
came from the same source, and then 
using the previous search, which is still 
valid; see Listing Two, page 122. Figure 
2 describes the TCP segment format. 

You might be tempted to elaborate 
upon these simple kinds of improve- 
ments or to spread the theme through- 
out the networking implementation. 


Header Prediction: Stretching the Present 
into the Future 

There is a header (or trailer) associated 
with the data of a packet that contains 


NETWORKING 


the control information relevant to the 


protocol processing of this and related 


packets. Usually, this information is pro- 
cessed as it is received. If it is a con- 
nection-oriented or “virtual circuit” pro- 
tocol (as opposed to a “connectionless,” 
or datagram protocol), the state for this 
connection must be updated, and new 
packets might need to be scheduled for 
transfer. (Perhaps either an acknowl- 
edgment or more data needs to be sent.) 
This represents the computational over- 
head of the protocol, and in most cas- 
es it occupies the front end of the pro- 
cess of accepting new data on reception, 
in conjunction with receipt of a pack- 
et. This is normal when we don’t know 
what the next packet might contain, or 
when it will arrive. 

Header prediction is founded on the 
sensible expectation that the next pack- 
et will come along on the heels of the 
previous one, and that the next pack- 
et’s header will likely be “guessable” by 
the software. These assumptions are true 
in cases associated with mass transfers 
in the first place; corrupted or lost pack- 
ets are low probability events, and the 
more elegant features of a given trans- 
port-level protocol are not usually ex- 
ercised by the applications programmer. 
So, when possible, header prediction 
uses a good heuristic to process the pro- 


tocol with a short-circuit evaluation. 

Essentially, the heuristic involves track- 
ing expected state changes associated 
with each communications “stream.” 
Thus, a model of each stream’s next pro- 
tocol header is prepared before the 
packet arrives. When it arrives, the in- 
coming header must simply be matched 
to its intended form. If the match is cor- 
rect, the precomputed “new” state is as- 
signed, the packet’s data is delivered, 
and packet processing is complete. This 
process may take very few instructions, 
thus reducing the critical path. Howev- 
er, if the match is incorrect, all the pro- 
cessing we attempted to avoid must now 
be done, and our speed advantage is 
lost. Fortunately, this occurs less than a 
fraction of a percent of the time, so it’s 
of little consequence. 

Listing Three (page 122) shows the 
code in 386BSD that attempts limited 
header prediction. It searches for two 
common cases: receipt of a TCP seg- 
ment that can be transferred to the ap- 
plication immediately, without further 
processing; and response to a simple 
acknowledgment to the last outstand- 
ing segment transmitted. For these cas- 
es, processing the TCP segment can be 
collapsed down to as little as 40-50 in- 
structions, instead of the several thou- 
sand generally involved. 
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The incoming packet (¢) is compared 
with recorded information about this 
TCP session (¢p) kept in a protocol con- 
trol block. We check to see that it: 


¢ Uses no other features of the proto- 
col (flags). 

e Is a consecutive, in-sequence order 
received packet (ti->ti_seg == tp-> 
rCcv_NXt). 

e Is not the subject of a retransmission 
(tp->snd_nxt == tp->snd_max, that is, 
the outstanding transmission). 


Also, we verify that the session is at 
steady state, with both sides proceed- 
ing in lockstep (ti->ti_win && ti-> 
ti_win==tp->snd_wnd). If so, we can 
process the packet, which is stored in 
a buffer (7m) overlaid by a data struc- 
ture (ti) that exposes the details of the 
packet’s TCP/IP headers. 

The heuristic works best with large 
packets of data that arrive at a consis- 
tent pace, without variation, as occurs 
most with fast transfers. With header 
compression, a modern 100 MIP work- 
station might be able to handle multigi- 
gabit transfers with little hardware assist. 


New Challenges in Very High-Speed 
Networking 

Once we understand more fully the na- 
ture of very high-speed network traf- 
fic, we must find ways (such as new 
protocol processing techniques) to bet- 
ter use the present arrangement of com- 
puter hardware and software. After all, 
people are not going to just throw out 
all their equipment and start from 
scratch. 

Demands for greater bandwidth, 
round-trip time exchanges, and bursty 
networks are just a few of the chal- 
lenges faced by very high-speed net- 
works. But they all illustrate this un- 
derlying problem: As you scale a base 
technology used to build a complex en- 
tity, risky side effects grow as a power 
function, or worse yet, as an exponen- 






Padding _[...Options...] 





Figure 2: TCP segment format. 
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tial! Take, for example, the pragmatic 
redesign of the Internet in response to 
its growth (in terms of number of hosts 
and additional demand of service). Nu- 
merous times, linear scaling has pro- 
voked nonlinear effects, thus necessi- 
tating evolution of the network 
standards and redeployment of the net- 
work structure itself. (You could make 
a fair argument as to why OSI has not 
displaced TCP/IP simply on the basis 
of not keeping up with the required 
changes that have strained the Internet 
in this way.) 








Forward-error Correction: When You 
Care Enough to Not Retransmit 

At still higher data rates than header pre- 
diction can handle, we might need to 
abandon the current paradigms in pro- 
tocols like TCP. One such paradigm is 
the network delay and processing over- 
head of handling retransmission of failed 
data. One simple method to avoid re- 
sending garbled data is to not resend it 
at all! Instead, additional information is 
sent in the form of an error-detection 
and correction-encoding scheme. Thus 
data integrity is preserved without the 
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need for retransmissions. The cost of 


this technique is contained in the addi- 
tional processing of the encoding/de- 
coding and the overhead of transmit- 
ting redundant data-state information. 


Current Segment 
for Transmission 


Redundant 
size m Transmission 
|) FIFO Queue 
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A serpentine, or “barbershop pole” 
algorithm is possible; see Figure 3. Since 
we won't know where the error might 
occur, we must transmit all the data at 
least twice, or transfer ways to recon- 
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Figure 3: Algorithm for forward-error correction. 
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struct it. For Figure 3, we just redun- 
dantly transmit all information, spacing 
the data at the largest time offset allowed 
by the buffers allocated on the sending 
side. This resembles the original re- 
transmission scheme, except that when 
we exceed the memory space of the 
transmitting computer system, we send 
the (delayed) packet. Thus the retrans- 
mission memory is a FIFO queue of 
packets. 

A scheme like this relies on proba- 
bilities: It assumes that the error con- 
dition is transitory (soft) within the FI- 
FO’s lifetime. If it isn’t, you can drop 
all the data in flight (1), abandon the 
connect (chances are it’s dying any- 
way), and return to the original re- 
transmission scheme, taking the per- 
formance hit. 

Forward-error correction addresses 
network delay, but doesn’t deal with in- 
creasing protocol-processing perfor- 
mance. Suppose you had a worksta- 
tion that you wanted to receive different 
real-time video feeds, each to a sepa- 
rate window, all in software. In this case, 
the assumptions that allow header pre- 
diction are not present. Can we cope 
with this even greater demand for net- 
work bandwidth? Yes, we can; protocol 
engines are one solution. 
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Protocol Engines: Specialized Hardware 
to Bypass the Bottlenecks 

At some point, processing the network 
protocols pushes the limits of the aver- 
age workstation too far. We might con- 
sider factoring in increasing amounts of 
dedicated hardware to more lithely han- 
dle the necessary processing. This is 
warping the hardware to fit the situa- 
tion; we can call this a “protocol engine.” 

A protocol engine departs from the 
traditional frame of computer system/ 
network integration by substituting vary- 
ing degrees of special-purpose hard- 
ware to remove bottlenecks that other- 
wise limit performance. In this case, 
both hardware and software are muta- 
ble to the degree that the desired perfor- 
mance objective needs to be achieved. 
If a protocol implementation’s software 
consumes too many hardware instruc- 
tions, we rebuild the hardware—we 
provide new facilities that make the soft- 
ware’s use of hardware fall into line. Ex- 
amples of this mountain-to-Mohammed 
strategy are providing separate data 
paths for the data and header of a pack- 
et or providing a separate functional unit 
to checksum a packet as it is processed. 
We might also wish to overlap routing 
lookup with ordinary packet process- 
ing, killing the processing if a route is 
found that exists to another network 
interface. 

We could conceivably process every 
step of protocol implementation in hard- 
ware, potentially parallelizing every op- 
eration possible. Such an endeavor 
would be extraordinarily expensive— 
and not necessarily justified. As men- 
tioned earlier, high-speed traffic is char- 
acterized by bursty behavior. An ultimate 
solution such as a fully hardware-im- 
plemented protocol stack would be 
used intensively only during those brief 
bursts. Given current expectations, this 
would be expensive overkill. In fact, the 
virtue of protocol engines lies in their 
potential to scale to vaster aggregate 
amounts of bandwidths by means of in- 
creasing application of VLSI hardware. 

Protocol engines are a controversial 
topic; many prominent critics argue that 
they are either underjustified, or a con- 
tinuation of the outmoded front-end 
processor arrangement popular in ear- 
ly networks. Front-end processors would 
entirely process the protocols on a sep- 
arate microprocessor, usually on the 
same board as the network interface. 
This allowed a nonnetworked comput- 
er system to interface to a network by 
adding a device driver and some soft- 
ware “glue.” The value of this approach 
was that it avoided changing the oper- 
ating system to work with the network. 
A vague argument for offloading the 
protocol processing was frequently 
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made. However, these units were usu- 
ally slower than native-mode protocols 
because the main processor was con- 
siderably faster than the front end, and 
the cost of communication between the 
main processor and the front end was 
just as expensive as having the main 
processor run the protocols itself! A true 


It’s important to be 
aware of various 
approaches as the 
world gradually 

migrates into gigabit 
networks. Simple 
and complex 
solutions can be 
found by “thinking 
gigabit” 


protocol engine is a difficult and con- 
siderable achievement, since it must out- 
perform main-processor technology 
without shifting the burden back to the 
main processor. 


Conclusion 

Testbed networks that employ the ele- 
ments described here are currently in 
operation at various technology centers 
around the world. In the U.S. alone, 
interest in the competing projects for 
the National Research and Educational 
Network CNREN), as well as the inter- 
est in fiber-based networks like FDDI 
and HiPPi, have catalyzed this activity. 
ATM and SONET are watchwords for 
scalable network bandwidth to take us 
into the gigabit/sec world. Improve- 
ments like single-element caching and 
header compression are already in 
Berkeley networking implementations 
that you can obtain today (386BSD). In 
short, gigabit networking is far closer 
than you think. 
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EXAMINING ROOM 





xtensions to the C and C++ lan- 
guages take many forms, de- 
pending on what the extenders 
have in mind. The ANSI C com- 
mittee includes the Numerical C 
Extensions Group, whose task it is to de- 
fine numerical extensions to the language 
beyond the ones already built in. Other 
language extensions support particular 
development platforms. For example, 
Borland’s Object Windows Library uses 
an extension to C++ class definition that 
supports the declaration of a message- 
response member function. 
The Borland C++ compiler includes 
a number of other extensions to the C 
language that support DOS systems pro- 
gramming, that branch of programming 
that includes device drivers, memory- 
resident programs, and other low-level 
activities. To illustrate the use of the ex- 
tensions, I'll describe TSRPLUS, a pub- 
lic-domain swapping terminate-and-stay- 
resident (TSR) driver that compiles with 
all versions of Borland and Turbo C. It 
is the resident part of a TSR application 
that swaps the memory of the inter- 
rupted program for the TSR application’s 
image, executes the TSR application, 
and swaps the original program back 
in. The source code includes the TSR- 
PLUS driver and a brief application pro- 
gram to serve as an example. Because 
of its length, the source to TSRPLUS is 
not printed in this issue, but is available 
electronically under the filename TSR- 
PLUS.ARC; see “Availability,” page 3. 
Note that this article is not a treatise 
on portable code. The code that you 
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write with these techniques is com- 
pletely nonportable to other computer 
architectures and other operating sys- 
tems. It is mostly nonportable to other 
compiler products. And in some rare 
cases, it is potentially nonportable to 
past and future versions of Borland C 
(BC). When you do systems program- 
ming this close to the hardware and op- 
erating system, portability is the least of 
your concerns. Similarly, this discussion 
does not address how TSRs work. There 
are a number of good works on this 
subject. An understanding of what al- 
lows a TSR to pop up and what rules it 
must obey is helpful here but not nec- 
essary. Where I discuss those issues, it 
is only to illustrate how I have used the 
language extensions of Borland C++ to 
solve their problems. You can learn 
about TSRs in several of my books and 
in Andrew Schulman’s more recent and 
comprehensive Undocumented DOS 
(Addison-Wesley, 1990). 

Most of the BC systems-programming 
extensions have been in the compiler 
since the first version of Turbo C. They 
provide the programmer with close ac- 
cess to the hardware, BIOS, and oper- 
ating system. The extensions include 
register pseudovariables, inline func- 
tions, the interrupt function type, and 
inline assembly code. With them, a pro- 
grammer can avoid most of the assem- 
bly language functions that systems pro- 
grams normally must call when standard 
C can neither reach the hardware nor 
meet the timing performance require- 
ments of the problem at hand. 

BC and other compilers include func- 
tions that support access to BIOS and 
DOS in their runtime libraries. These 
include such functions as int&S6, bioskey, 





getvect, and so on. There is no standard 
for the types, names, and parameters 
of these functions, but the BC library 
includes versions compatible with their 
earlier compilers as well as versions 
compatible with Microsoft C. Why use 
language extensions instead of the li- 
brary functions? In most cases you can 
achieve the same results by using the 
language extensions, and you will have 
smaller, faster code. Sometimes you 
want to do something for which there 
is no library function. Some library func- 


tions reference other functions or glob- 


al variables that force other object mod- 
ules to be linked as a side effect, if not 
in the latest version of the compiler, 
then perhaps in a future version. This 
can cause the executable code to be 
larger than it needs to be. Using the 
language extensions can bypass such 
side effects. Some library functions de- 
pend on features provided by the start- 
up code, such as stack, heap, and en- 
vironment variable pointers. Later, we'll 
replace the startup code to get the 
smallest possible program, and we will 
not use those things. Calling some li- 
brary functions from this program 
would result in unresolved references 
when you link. 


Register Pseudovariables 

BC’s register pseudovariables are fixed 
variables that directly address the micro- 
processor’s registers. Their names in- 
clude _AX, _BX, _ES, _FLAGS, and so 
on. You can assign an integral value— 
including address segments and off- 
sets—to one of these pseudovariables, 
which puts the value into the corre- 
sponding hardware register. You can 
use a pseudovariable in an expression, 
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(continued from page 94) 
and its contents are treated as if they 
came from an unsigned int. 

When would you want to use regis- 
ter pseudovariables? A common use is 
to send parameters to interrupts and to 
read the results that interrupts return in 
registers. We will do some of that later. 
But you must be careful. The compiler 
assumes not only that you know what 
you are doing but that you know what 
it is doing as well. The compiler itself 
uses registers in many different ways. 
Your use of a register and the compil- 
er’s use of the same register must not 
conflict. In some cases, the compiler is 
smart enough to see that you are using 
registers and it will avoid their use. For 
example, the compiler can use hard- 
ware registers for automatic variables. 
If you use the corresponding register 
pseudovariables, the compiler will de- 
cide not to use them and will put the 


abS* 


MOV. ..e 620 
push ds 


pop es 


axis 
ax,ds 
es,ax 


_DS; 
[23% 





Example 1: (a) This code fragment 
compiles correctly with Turbo C 2.0, 
but not with Borland C++ 3.0; (b) 
code generated by Turbo C 2.0 using 
the -S command-line option; (c) code 
generated by Borland C++ 3.0 using 
the -S command line option; (d) code 
to resolve the problem of the compiler 
assigning registers when there is no 
corresponding mov instruction. 


Ox48; . 


automatic variables on the stack frame 
or in registers that you do not use. 

In other cases, the compiler is not so 
smart. Sometimes it seems to get less so 
with successive versions of the compil- 
er. For example, the code fragment in 
Example 1(a) compiles correctly with 
Turbo C 2.0, but not with Borland C++ 
3.0. To see why, we can use the -S com- 
mand-line option to look at the com- 
piled assembly language code from the 
two compilers. Turbo C 2.0 generates 
the code shown in Example 1(b), and 
Borland C++ 3.0 generates that shown 
in Example 1(c). 

In the TC example, the compiler us- 
es a push and pop to assign DS to ES. 
In the BC example, the compiler moves 
DS to ES through AX, the same register 
to which you just assigned a value. Your 
value is overwritten before you have a 
chance to use it. The older compiler is 
no smarter than the newer one with re- 
spect to which pseudovariables you 
used. It just uses a different technique 
to assign registers where the machine 
language has no corresponding mov in- 
struction. In this case, the newer tech- 
nique generates a conflict between your 
use of the _AX pseudoregister and the 
compiler’s use of the AX register. Ex- 
ample 1(d) shows how you can fix the 
code. Now, both compilers generate 
correct code. That does not mean, how- 
ever, that future versions of the com- 
piler will not find another way to trip 
up your use of register pseudovariables. 
At all times, use register pseudovariables 
only when you know exactly what their 
use will lead to. 


Inline Functions 
BC has several macros that generate in- 
line code. With them you can directly 
access memory, interrupts, and hard- 
ware devices. The compiler generates 
inline code for the machine instructions 
that perform the tasks of the macros. 
The geninterrupt macro takes an in- 
terrupt number as an argument and ex- 
ecutes the corresponding int machine 


// Allocate memory function 


10; // # of paragraphs to allocate 


geninterrupt (0x21) ; 
if ((_FLAGS & 1) == 
segment = _AX; 
else 
LESSEE os 


Pyeatl DOS 
//-test carry ‘bit 
// segment of the allocated block 





Example 2: Allocating a block of DOS memory. 


disable(); 
_SS = oldss; 


_SP = oldsp; 
enable(); 





// interrupts must not occur now 
// otherwise SS and SP will be wrong 


Example 3: The interrupt-enabled condition is controlled by a bit in the FLAGS 


register. 
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instruction. You normally use register 
pseudovariables in conjunction with this 
macro. For instance, you can allocate a 
block of DOS memory with the code in 
Example 2. This is an example of how 
the BC language extensions generate 
code as good as you can write in as- 
sembly language. You might suspect 
that the FLAGS & 1 expression will use 
the AX register, thus interfering with the 
AX assignment that follows, but not so. 
The compiler codes a simple JC Gump 
if carry) opcode for the expression. 

The inport, inportb, outport, and out- 
portb macros read and write hardware 
I/O ports. Their most common use is to 
access the interrupt controller and read 
the keyboard port. If you are using BC 
to write a device driver for a custom 
hardware device, you could make ex- 
tensive use of these macros. 

The enable and disable macros gen- 
erate the sti and cli machine instructions 
to enable and disable interrupts. You 
will use them whenever you need to 
suspend and resume interrupts for any 
reason. Interrupts are normally enabled 
when a program is running. Sometimes 
you will need to disable interrupts so 
that you may do something that cannot 
be interrupted. For example, anytime 
you change the stack segment and 
pointer registers, you should disable in- 
terrupts so that an interrupt does not 
occur while the stack integrity is com- 
promised. In Example 3, the interrupt- 
enabled condition is controlled by a bit 
in the FLAGS register. When an inter- 
rupt occurs, it pushes the FLAGS and 
the CS:IP registers on the stack, disables 
interrupts, and writes the contents of 
the interrupt vector into the CS:IP reg- 
isters. That starts the interrupt service 
routine (SR) running with interrupts dis- 
abled. The iret instruction from the ISR 
pops the flags and the registers, which 
enables interrupts and returns to the 
interrupted location. The result is that 
most ISRs execute with interrupts dis- 
abled. If you are writing an ISR that in- 
volves extensive processing, you will 
need to enable interrupts from within 
the ISR. Pop-up TSR programs are ex- 
amples of programs that execute as the 
result of an interrupt. If you did not en- 
able interrupts before running the TSR 
program, it would not be able to use 
any of the system services. 

There are macros that allow you to 
retrieve and write the contents of mem- 
ory by direct access to the memory ad- 
dress. They are the peek, poke, peekb, 
and pokeb macros. With them you spec- 
ify the segment and offset of the ad- 
dress. The peek and peekb macros re- 
turn the contents of the memory word 
or byte. The poke and pokeb macros ac- 
cept a word- or byte-value parameter 
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(continued from page 96) 

that they write into the memory loca- 
tion. You will use these macros primarily 
to read and write video memory and 
the BIOS data areas. There are other far 
locations that you will need to read and 
write, such as DOS memory blocks, and 
you Will usually use far pointers or the 
movedata function to access them. 

The getvect and setvect functions read 
and write the contents of the specified 
interrupt vectors. You will use these to 
hook your ISR to an interrupt and to 
chain your ISR to the previous holder 
of the interrupt vector. Example 4(a) 
shows the initialization code for your 
program. Your ISR, which now executes 
when the interrupt occurs, will chain to 
the old interrupt the way shown in Ex- 
ample 4(b). It might do the chain first 
before it does its own processing of the 
interrupt; it might do it last; or it might 
do it only if certain conditions are sat- 
isfied. Some ISRs will not chain the in- 
terrupt at all. 

Before your program terminates, it 
must return the original value to the in- 
terrupt vector using the call shown in 
Example 4(c). 


(a) 

void interrupt (*oldISR) (void) ; 
oldISR = getvect (VECTOR) ; 
setvect (VECTOR, newISR) ; 


(b) 
(*o1dISR) (); 


(c) 
setvect(VECTOR, oldISR); 


Example 4: (a) Initialization code; 
(b) chaining to the old interrupt; (c) 
returning the value of the original in- 
terrupt vector. 


(a) 


typedef struct { 





The Interrupt-function Type 


The interrupt-function type tells the 


compiler to compile the function as an 
ISR. An interrupt function assumes that 
it is being called as the result of an inter- 
rupt, perhaps from an unrelated pro- 
cess. It may assume nothing about the 
values of registers. Therefore, upon en- 
try, the interrupt function pushes all the 
registers on the stack, initializes the DS 
register to point to the program’s 
DGROUP segment, and sets up the 
stack frame in the BP register. Upon ex- 
it, the interrupt function pops all regis- 
ters from the stack and executes the iret 
machine instruction to return to the in- 
terrupted location. 

Often an ISR needs to read the val- 
ues of registers to get its parameters and 
needs to return its results in registers. 
Up to a point, you can read the pa- 
rameters with the register pseudovari- 
ables. This works only as long as the 
compiler does not use the same regis- 
ters itself. You cannot return values from 
the ISR by writing to the register pseu- 
dovariables because the interrupt func- 
tion pops the original values into the 
registers just before it returns. 

When the interrupt function pushes 
all the registers, it leaves them on the 
stack just as if the function had been 
called with those registers as parame- 
ters. You can declare the function with 
the IREGS data type, see Example 5(a), 
in its parameter list. The interrupt func- 
tion declaration looks like Example 
5(b). You can read the value of the reg- 
isters upon entry by reading the cor- 
responding members of the structure, 
as in Example 5(c). You can change 
the value of registers that will be re- 
turned to the caller by changing the 
contents of the structure members, as 
in Example 5(d). 


int bp; di sido eswaxyex) bx, Ax, 1p..ce&ard3 


} IREGS; 


(b) : 


void interrupt newISR(IREGS ir) 
{ 


Ree 
} 


(c) 


Tt ugh rake) 
cen ey agen) 


// return 3 in ax 
// return the carry bit on 





Example 5: (a) Declaring the function with the IREGS data type in its 
parameter list; (b) the interrupt-function declaration, (c) reading the value of 
the registers upon entry by reading the corresponding members of the structure; 
(d) changing the value of registers that will be returned to the caller by 
changing the contents of the structure members. 
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When the return sequence pops the 
values back into the registers, these new 
values will go from the structure on the 
stack frame into the machine registers. 

If you are chaining to an old ISR, you 
must make sure that the registers on en- 
try are preserved and that the registers 
on exit are put into the copy of the 
structure on the stack frame. If you do 
any processing before the chain, you 
must reset the real registers from the 
stack frame. Example 6(a) illustrates how 
you do that. If the old ISR returns some 
values in registers that the caller needs 
to receive, you must make similar pro- 
visions upon the old ISR’s exit, as in Ex- 
ample 6(b). 

If the old ISR uses the DS register for 
input, you need an additional step. The 
call to o/dISR is through a pointer in the 
data segment of the ISR. If you change 
the DS register, the call will fail, and the 
program will probably crash. Example 
6(c) shows you what you then must do. 
If your ISR does some processing after 
the chained interrupt service returns, 
you must take steps to preserve the DS 
register, as in Example 6(d). 

There is one more tricky aspect to all 
this. Some ISRs, most notably the VGA 
BIOS, receive parameters and return val- 
ues in the BP and DS registers. You must 
intercept and chain these ISRs with a 
separately compiled assembly language 
program that maintains its function 
pointer in the code segment rather than 
in the data segment or the stack frame. 
You should similarly intercept any 
chained interrupts where there is no 
comprehensive standard for their use. 
The Ox2f interrupt is an example of such 
an interrupt. 

An interrupt function does not have 
to be executed as the result of an inter- 
rupt. You can call an interrupt function 
from within your program. The compil- 
er generates a pushfinstruction followed 
by the far call to the function, emulat- 
ing what happens when an interrupt oc- 
curs. You can use this behavior to ad- 
vantage. The swapping TSR driver stores 
the address of an interrupt function in 
the TSR’s application module. The in- 
terrupt function is the application’s pop- 
up entry point. After swapping the ap- 
plication module into memory, the TSR 
driver will execute it by calling the en- 
try code through the interrupt-function 
pointer. The interrupt function’s entry 
logic prepares the function for execu- 
tion by setting up the registers. 


Inline Assembly 

BC recognizes the asm keyword, which 
tells it that the statement is an assembly 
language instruction. The compiler in- 
serts the instruction into the object code. 
You can use the names of C variables 
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in these instructions, and the compiler 
will generate the proper references. You 
may not use the asm keyword to de- 
clare a variable or access things like 
DGROUP. As with register pseudovari- 
ables, you must know what you are do- 
ing when you use inline assembly code. 


A Swapping TSR Driver 

Now we will put these techniques to 
use. The accompanying program is an 
example that uses the principles we 
have been discussing. It is a swapping 
TSR driver program called TSRPLUS. I 
originally developed its concept in 1989 
and published it in a book called Ex- 
tending Turbo C Professional. Since then, 
I have modified the program several 
times and used it in many programs. 
There are several versions of TSRPLUS, 
each of which handles the swapping 
problem differently. This version installs 
a 5K TSR driver program that swaps in 
a much bigger pop-up application when 
the user presses the hot key. 

A swapping TSR consists of two 
pieces: the permanently resident TSR 
driver and the transient pop-up appli- 
cation image. The resident part watch- 
es the keyboard interrupt for the hot 
key and manages memory swapping. 


(a) 


void interrupt newISR(IREGS ir) 
{ 


The transient part does the job of the 
pop-up application. The swap file can 
be any mass storage medium. If you can 
use EMS, XMS, or a RAM disk, the swap- 
ping operation is faster. If you use a disk 
file, the swapping operation will take 
longer, depending on the sizes of the 
pop-up application image and inter- 
rupted program. 

The pop-up application image and 
the swapped-out interrupted program 
include copies of the interrupt vectors. 
This is because an interrupt vector might 
have been hooked by and point into 
the program that you are going to tem- 
porarily replace. If you do not restore 
the interrupt vectors to a condition com- 
patible with the pop-up program and 
the hooked interrupt occurs, the system 
will crash. 


The Pop-up Application 

The pop-up application program is a 
normal DOS program that does not use 
any of the DOS functions from 0 to 12 
and does not spawn other programs by 
calling DOS. It links with a module that 
allows it to register itself with the TSR 
driver program as a pop-up program. 
The module supplies the program’s 
main function. The program must pro- 
vide three functions: one for any ini- 


// do some processing that might change registers 


SAX Se ax 
She =a hx 


// restore the registers that the old ISR needs 


(*oldISR)(); // chain to the old isr 


} 
(b) 


void interrupt newISR(IREGS ir) 
it 


(*o1dISR) (); 

ir.ax = _AX; 

ir.fl = _FLAGS; 
} 


(c) 
void interrupt newISR(IREGS ir) 
{ 

void interrupt (*tmpISR) (); 


// chain to the old isr_ 
// caller will get ax 
// and the flags 


tmpISR = oldISR; // use function pointer on the stack 


abs. = trtde: 
(*tmpISR) (); 


(d) 
void interrupt newISR(IREGS ir) 
{ 

void interrupt (*tmpISR) (); 


// reset DS 
// chain to the old isr through tmp ptr 


int oldds = _DS; // save ISR's DS 
tmpISR = oldISR; // use function pointer on the stack 


D0S->-trsdae 
(*tmpISR) (); 
_DS = oldds; 
ify zat 


// reset DS 
// chain to the old isr through tmp ptr 
// reset DS 
do further processing 





Example 6: (a) Resetting the real registers from the stack frame; (b) making 
similar provisions upon the old ISR’s exit; (c) handling the case when oldISR 
uses the DS register; preserving the register. 
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tialization code that the program needs 
and that calls the register function; one 
to be executed upon pop-up from the 
hot key; and one to be executed if the 
user runs the pop-up program from the 
command line without the TSR driver 
being resident. 

Other than those differences, the pop- 
up application program looks like any 
other DOS program. It is an .EXE file 
that will execute as a command-line pro- 
gram or that can be a pop-up. 

You load the swapping TSR program 
into memory by running the TSR driv- 
er program. It sets itself up to be a TSR 
and then calls DOS to execute the pop- 
up program. DOS loads the pop-up pro- 
gram into memory just above the TSR 
driver. The pop-up program does its ini- 
tialization and then calls into the TSR 
driver to register itself. The registration 
includes a far pointer to the applica- 
tion’s pop-up entry address. The TSR 
driver writes the pop-up program’s im- 
age and the current interrupt vectors to 
the swap file. The driver returns to the 
pop-up program, which exits, returning 
to the TSR driver. The TSR driver ter- 
minates, declaring itself resident. 

When the user presses the hot key, 
the TSR driver handles all the tests and 
context switching necessary to pop up 
a TSR. Then it swaps the interrupted 
program from memory to the swap de- 
vice, reads the pop-up application im- 
age into memory, and calls the pop-up 
application to execute. When the pop- 
up application returns to the TSR driv- 
er, the driver reads the swapped image 
of the interrupted program back into 
memory and returns to the interrupted 
location. 


Swapping 

There are two ways to swap memory. 
One way swaps just as much memo- 
ry as the pop-up program needs. This 
method leaves the DOS memory con- 
trol block chain in disarray as long as 
the pop-up program is running. The 
pop-up program cannot allocate any 
DOS memory when you use this tech- 
nique. The other way swaps the en- 
tire memory-control block chain-out. 
If you are at the DOS command line 
when you press the hot key, very lit- 
tle memory swaps out. If you are run- 
ning a large program, a lot of memo- 
ry swaps out. The pop-up program 
loads up to and including its termi- 
nating MCB. It can, therefore, allocate 
and deallocate DOS memory while it 
is resident. The example program us- 
es this strategy. 

When the pop-up program first runs, 
it tests to see if the TSR driver is resi- 
dent. If so, the pop-up program regis- 
ters itself in the manner just described. 
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(continued from page 100) 

If not, it can run as a DOS command- 
line program. To the user, the only dif- 
ference is the way that the program is 
executed. This approach allows the 
same .EXE file to work in the TSR en- 
vironment, from the command line, or 
in a window of a multitasking environ- 
ment such as Desqview or Windows. 


Memory Organization 

The memory organization of a C pro- 
gram and the memory requirements of 
a TSR are incompatible. The typical C 
program consists of all the code, fol- 
lowed by all the data, followed by the 
heap and stack. A TSR contains initial- 
ization code and resident code. Ideally, 


'data' 


segment para public 
ends 
segment word public 
ends 
segment byte public 


"bss! 


'bss' 
ends 

segment stack 'stack' 

ends 

segment byte public 'code' 
ends 





Example 7: The code at the front of 
the module will cause the linker to ar- 
range the segments. 


Ter..exe .: 


you would release the memory occu- 
pied by the initialization code to DOS 
when the TSR issues the terminate-and- 
stay-resident function call. TSRs written 
in assembly language can do that with 
little trouble. To get the same effect in 
C requires some manipulation of the 
startup code and the order in which 
things link. 

Borland C’s startup code does a lot 
of things for you. It sets up the heap, 
the stack, the global variables, the di- 
vide-by-zero handler, the pointer to the 
environment variables, the arguments 
to main, the file-handle table, and the 
external uninitialized data space. After 
everything is set up, the startup code 
calls your main function. When the main 
function returns, the startup code cleans 
every thing up, tests for null-pointer as- 
signments, and returns to DOS. In the 
process of doing all this, the startup 
code declares some public variables and 
procedures and refers to some others 
from the runtime library. 

You get the source code to the start- 
up code with the compiler. It is in a file 
named CO.ASM. Most of what it does is 
not necessary for the TSR-driver pro- 
gram. The TSR driver does not use a 
heap, parses its own command-line ar- 


tsr.obj emm.obj xms.obj int2f.obj tsrinit.1lib 


tlink /m /s int2f emm xms tsr,tsr.exe,tsr,$(CLIB) tsrinit 


Ser ka og cP Ld eg 


cOt.obj tsrinit.obj emminit.obj xmsinit.obj init.obj 
tlib tsrinit +tsrinit.obj +c@®t.obj temminit.obj +xmsinit.obj +init.obj 





Example 8: Using the makefile to separate the initialization code from the 
resident code and get the runtime-library code loaded between the two parts. 


(a) 
if (_CS > OxaQOO) { 


dispstr("\a\r\nCannot loadhigh") ; 


return; 


} 
(b) 


EXTERN struct vectors { 
int Vino? 


void (interrupt **oldvect) (void) ; 
void (interrupt *newvect) (void) ; 


} vectors[] = { 

{TIMER, 
{INT28, 
{KYBRD, 
{DISK, 
{VIDEO, 
{TSRPLUSINT, 
(2, 


&oldtimer, 
&o01d28, 
&oldkb, 
&olddisk, 
&oldvideo, 
&o01d2f, 
NULL, 


#define newvectors(vecs) 


register struct vectors *vc 
while (vce->vno) { 


*(ve->oldvect) = getvect(vc->vno) ; 
setvect(vc->vno, vc->newvect) ; 


vert, 


Example 9: (a) Testing to see if the program is loaded high; (b) hooking and 


t+) 


())newtimer}, 
(*) () )new28}, 
(*) ())newkb}, 
(*) ())newdisk}, 
()) 
()) 


(void 
(void 
(void 
(void 


interrupt 
interrupt 
interrupt 
interrupt 
(void interrupt 
(void interrupt 
NULL}}; 


(*) ())newvideo}, 
(*) new2f}, 
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chaining several interrupts with the table and macro statements. 


102 





guments, and finds its own environment 
variables. Therefore, the TSR driver us- 
es a highly modified version of the start- 
up code, COT.ASM. The modified start- 
up code declares the program’s starting 
address, sets up the segment registers 
and the global _ psp variable, sets the 
external uninitialized data space to Os, 
and calls the main function. That’s all it 
does, because that is all the TSR driver 
needs in the way of startup code. 

The TSR executes its initialization 
code and retains only the resident part 
when it becomes a resident program. 
This is not so easy because the com- 
piler and linker organize all the code 
ahead of the data. To separate the ini- 
tialization code from the resident code 
and retain the data segment for the res- 
ident program, you must first change 
the order in which the linker builds the 
segments. You need the stack and da- 
ta to come ahead of the code. That way 
you can truncate the initialization code 
without losing any of the data space. 

The linker determines the order of 
segments based on the order of their 
declaration in the first object file it en- 
counters. Remember that we changed 
the startup code. That code will not be 
the first module seen by the linker be- 
cause the startup code is toward the end 
of the code segment so that its memo- 
ry is returned to DOS. 

There are two other assembly lan- 
guage modules in the TSR-driver pro- 
eram. One of them, INT2F.ASM, con- 
tains the assembly code for the video 
and Ox2f ISRs. This module must be the 
first one that the linker sees. The code 
in Example 7 at the front of the mod- 
ule will cause the linker to arrange the 
segments the way we want them. 

The next problem we run into is that 
the linker will link all our code followed 
by the functions from the runtime li- 
brary into the code segment. We need 
to separate the initialization code from 
the resident code and get the runtime- 
library code loaded between the two 
parts. We will handle that operation in 
the makefile, as shown in Example 8. 

The makefile says that the TSR.EXE 
program depends on the object files that 
constitute the resident part of the TSR 
driver program and the tsrinit.lib library 
file. The TSRINIT.LIB file contains the 
object files for the TSR driver’s initial- 
ization code. The TSRINIT.OB] file must 
be first, and the INIT.OBJ file must be 
last in this library. These files, besides 
anything else they might contain, have 
the addresses of the beginning and end 
of the initialization code. 

The tlink command links the resident 
code first, then searches the C runtime 
library, and finally searches the TSRINIT 
library. This sequence arranges the 
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(continued from page 102) 
source modules in the code segment 
the way we want them. 


Operation 

You run the TSR driver program from 
the command line and it loads the swap- 
ping pop-up application. The TSR driv- 
er has several command line switches 
that modify its behavior. These are: -x, 
don’t use XMS for the swap file; -e, don’t 
use EMS for the swap file; and 
+p<path>, the DOS path for the disk 
swap file. 

If you use -x and -e and do not spec- 
ify a path, the TSR driver will write the 
swap file to the subdirectory specified 
by the TEMP environment variable. 

Because of the way the TSR driver 
program and the pop-up application 
interact with memory and the DOS 
memory-control block, you cannot load 
the program into high memory. If you 


did, the system would probably crash. 
Rather than allow that to happen, the 
TSR driver program tests to see if it is 
loaded high, as in Example 9(a). The 
TSR driver program hooks and chains 
several interrupts with the table and 
macro statements in Example 9(b). This 
example shows how the C preproces- 
sor can emulate the C++ inline function 
to a limited extent. The program calls 
the newvectors macro and passes the 
address of the table using newvec- 
tors(vectors);. 

There is an equivalent oldvectors 
macro that restores interrupt vectors 
from the table. The program uses an- 
other table to hook and restore the crit- 
ical interrupt, break, and ctrl+c interrupt 
vectors. 


Registering the Application 
The TSR driver program executes the 
pop-up application program so that the 


compute program size 
highmemory = _CS + ((unsigned)&codeend / 16); 
sizeprogram = highmemory - _psp; 


adjust MCB for TSRPLUS 


-PSP; 

sizeprogram; 
_AX = 0x4aQQ; 
geninterrupt (DOS) ; 





Example 10: The TSR driver program reduces its own size by calling the DOS 
Ox4a function to changes the size of the PSP’s memory-control block. 


(a) 


sizeprogram = _CSt+((unsigned) main >> 4)-_psp; 
if ((unsigned) main % 4) 


Sizeprogramt+t+; 


(b) 


_DX = sizeprogram; 
_AX = 0x3100; 
geninterrupt (DOS) ; 





Example 11: (a) The TSR driver program computes a new size for itself as the 
distance from the PSP address to the main function; (b) with the size of the 



















{ 


if (hotkeyhit) { 


return; 


} 
} 
(*oldkb) () ; 


resident portion computed, the TSR can terminate and declare itself resident. 


a keyboard ISR-sse.=> #/ 


static void interrupt newkb(void) 


unsigned char kbval = inportb(@x69) ; 
if (!thotkeyhit && !running) { 
if (Keymask && (peekb(@, @x417) & Oxf) == Keymask) 
if (Scancode == @ |; Scancode == kbval) 
hotkeyhit = TRUE; 


/* --- reset the keyboard ---- */ 
kbval = inportb(0x61) ; 
outportb(@x61, kbval | 0x8); 
outportb(@x61, kbval); 
outportb(Ox20, 9x20); 


Example 12: After the TSR driver is resident, its keyboard ISR watches the 
keyboard-input port and the word in the BIOS data space that stores the current 


shift key’s state. 


104 


pop-up can register itself as a swapped 
TSR. First the TSR driver program re- 
duces its own size by calling the DOS 
Ox4a function to change the size of the 
PSP’s memory-control block, as shown 
in Example 10. This step is necessary 
because DOS assigns to a program’s PSP 
all available memory, leaving no room 
to run the pop-up application. In a nor- 
mal C program, the startup code takes 
care of this reduction, using the sizes of 
the stack and heap to determine the size 
of the program. The TSR driver’s start- 
up code does not, however, so the driv- 
er does it here. 

The program computes its new size 
by adding the paragraph address of the 
codeend \abel—the last entry in-the 
code segment—to the value in the 
code-segment register. That value is the 
segment address of the top of the pro- 
gram. By subtracting the address of the 
PSP from the high address, the program 
computes the minimum paragraph size 
in which it can execute. By telling DOS 
to reduce itself to that size, the program 
assures that the next program run will 
load as close to it as possible. 

Next, the TSR-driver program uses the 
DOS 0x4b function to execute the pop- 
up application program. The pop-up 
application program was linked with 
the tsrbuild.c code, which calls the TSR- 
driver program with a Ox2f interrupt to 
register itself. It passes the address of a 
structure that contains its entry point 
and PSP address. The TSR-driver pro- 
gram records this information and 
makes a copy of the pop-up program’s 
interrupt vectors and program image on 
the swap file. It returns to the pop-up 
application which terminates, returning 
control to the TSR-driver program. 


Terminating and Staying Resident 

Now the TSR-driver program computes 
a new size for itself a second time. This 
size will truncate the initialization code 
when the TSR driver becomes resident. 
The new size is computed as the dis- 
tance from the PSP address to the main 
function, as shown in Example 11(a). 
The main function is the first function 
in the initialization code and is, there- 
fore, at the address of the top of the res- 
ident code. With the size of the resident 
portion computed, the TSR can termi- 
nate and declare itself resident, as 
shown in Example 11(b). 

Now that the TSR driver resides in 
memory and an executable image of the 
pop-up application program is stored 
in the swap file, the system can return 
to the DOS command line and run oth- 
er programs. When the user presses the 
hot key, the program can pop up. Af- 
ter the TSR driver is resident, its key- 
board ISR watches the keyboard-input 
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Example 13: Save the interrupted 
stack location and change it to the 
TSR driver's stack. 


(continued from page 104) 

port and the word in the BIOS data 
space that stores the current shift key’s 
state, like Example 12. This interrupt 
function uses several of the Borland C 
extensions. It reads the value of the key- 
board-input port by calling the importb 
macro. It reads the BIOS data area’s 
shift-state mask by calling the peekb 
macro. It resets the keyboard hardware 
and the interrupt controller by calling 
the outportb macro. And it chains to the 
old keyboard ISR by calling it through 
an interrupt-function pointer. This func- 
tion does not pop the application pro- 
gram up. It simply sets a flag that says 
the user pressed the hot key. The 
hooked timer and 0x28 ISRs pop up the 
program when the hot-key indicator is 
set and DOS is in a safe and stable con- 
dition for a pop-up. 

After it is safe to pop up, the TSR- 
driver program switches the DOS con- 
text from the interrupted program to it- 
self. First it changes the stack. The 
interrupted program might not have a 
deep enough stack, and the swap will 
probably write over that space anyway. 
Changing the stack is simple. The ini- 
tialization code saved the stack’s seg- 
ment and pointer register values before 
the TSR-driver program became resi- 
dent. The code in Example 13 saves the 
interrupted stack location and changes 
it to the TSR driver's stack. It is not nec- 
essary to disable interrupts to protect 
the stack’s integrity. Interrupts are cur- 
rently disabled because the program is 
operating from within an ISR. The TSR 
driver saves and resets the DTA, PSP, 
and Ctrl-Break setting. Then it starts the 
swap sequence. 

To swap the interrupted program and 
the pop-up application program, the 
TSR driver uses this sequence: It writes 
the interrupted program’s memory to 
the swap file. Then it writes the system- 
interrupt vector table to the swap file. 
Next it reads the pop-up application 
program’s interrupt vector table. Final- 
ly it reads the pop-up application pro- 
gram’s program memory. This sequence 
is critical. The swapping input/output 
operations could enable interrupts. If 
the new code is in memory along with 
the old interrupt vectors, an unexpect- 
ed interrupt could jump to the wrong 
place. 

After the pop-up application program 
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is swapped into memory, the TSR-driv- 
er program executes it by calling 
through the far interrupt pointer that 
holds the address of the application’s 
entry location. That code is in the source 
file tsrbuild.c that the application links 
with. It does a similar context switch 
between the stacks, DTAs, and PSPs of 
itself and the TSR-driver program, and 
then it executes its application. When 
the application returns, the program 
switches the context back and returns 
to the TSR-driver program. 

When the pop-up application returns 
to the TSR-driver program, the driver 
swaps the interrupted program back in, 
swapping the code first and then the in- 
terrupt vectors. Once again, this se- 
quence is critical. The code for the in- 
terrupted program’s ISRs must be in 
place before any interrupt vectors point 
to it. Next, the TSR driver switches the 
context of the PSP, DTA, and the stack, 
and returns to the interrupted program. 


Unloading the TSR 

The pop-up application program de- 
cides when to unload itself. It calls the 
TSR driver through the Ox2f interrupt 
with a function code that says the driv- 
er should unload itself. The driver sets 
a flag. When the pop-up application re- 
turns to pop down and after the driver 
has swapped the interrupted program 
back in, the driver tests to see if it can 
unload itself. That test makes sure that 
all the interrupt vectors are the same as 
they were when the TSR driver declared 
itself resident, and that there is no pro- 
eram loaded above the TSR driver in 
memory. In other words, the TSR-driv- 
er program can unload only when the 
TSR was popped up from the DOS com- 
mand line and only when no other TSR 
programs are loaded above it. The test 
uses the peek and peekb macros to walk 
the DOS memory-control block chain 
and uses gefvect to compare the con- 
tents of the interrupt vectors with their 
earlier values. 


When Assembly is Needed 
Sometimes you cannot get by without 
some assembly language. We've already 
discussed the startup code. There are 
two other assembly language modules 
that the TSR driver uses. You saw how 
the INT2F module controls the place- 
ment of segments. It also provides the 
ISRs for the 0x10 and Ox2f interrupts. 
These interrupts use registers in ways 
incompatible with the interrupt func- 
tion. The assembly language ISRs chain 
to the old ISRs without disturbing reg- 
ister integrity. 

The INIT module provides the _code- 
end variable, and it provides a function 
that copies the chained 0x10 and Ox2f 





interrupt vector contents into interrupt- 
function pointers in the code segment’s 
address space. This allows the TSR driv- 
er’s assembly language ISRs to chain 
without needing data-segment refer- 
ences to the pointers. 


Conclusion 

You debug the pop-up application pro- 
gram as a DOS command-line program 
by using the source-level debugger. I 
debugged the TSR driver by using Tur- 
bo Debugger with the load and execute 
of the overlaid pop-up application 
stubbed out. I debugged those parts in 
the old way—by displaying messages 
on the screen at critical places in the 
program. It was slow, but sometimes 
that’s the best or only way. 

Finally, the TSRPLUS source code in- 
cludes an example pop-up application 
program. It is a simple D-Flat applica- 
tion, which means that it uses the user- 
interface library that I have been pub- 
lishing in Dr. Dobb’s Journal for the past 
year. Its purpose is to show you how 
to use the TSRPLUS driver. It doesn’t do 
anything other than pop up and down 
and let you remove it from memory 
with a menu selection. Link it with the 
D-Flat library, version 12 or greater. 


DDJ 


Vote for your favorite feature/article. 
Circle Reader Service No. 8. 


C' Language 


#87 


: How can I call Fortran (Basic, Pascal, 
ADA, Lisp) functions from C? 


: The answer is entirely dependent on 
the machine and the specific calling 
sequences of the various compilers in 
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: Does anyone know of a program for 
converting Pascal (Fortran, Lisp, “Old” 
Cy iad tO Ut 


Several public-domain programs are 
available, namely ptoc, p2c, and f2c. 


#89 


: Where can I get copies of all these 
public-domain programs? 


See the regular postings in the 
comp.sources.unix and 
comp.sources.misc newsgroups for 
information. 


Copyright © 1992 Steve Summit 
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File Query View Applications Vocabulary Option Help 





List all orders placed in Jan 92 by customers who live in Arizona. 
Who are our New York customers? 

Give me Pat Blake's address! 

How many items have a shipping weight of less than seven? 
Count order dates grouped by customer. 

List any toner cartridge orders in Dec 91. 

Who bought paper? .~q—___ Execute this sentence 


to get this report ___e 
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pick-and-choose menus, making report specifications a _ 
snap. The VAR package allows developers to define _ 
application specifics, such as data base and vocabulary —T 
relationships. Enhance your product by giving yourend 
users the ability to query in English, QBE or SQL. _ 


« Natural Query VAR $395.00 
« Natural Query End User $169.00 


FairCom SQL Servers® 


These multi-threaded database servers are designed for 
developers who demand control. FairCom's implementation 
offers a seldom-found solution for developers. While one may 
mandate an SQL Server, another's real-time demands may not file management needs YY 
tolerate the overhead and additional resources associated with __ ¢-tree Plus has established 
SQL data access. Our unique servers offer the full range of commercial development. 
data accessibility: low-level; ISAM-level; SQL-level. advantage of the high-leve 


: 7 ae random or sequential acce 
FairCom's innovative implementation of transaction processing  . 


lies at the heart of its server technology, providing complete c-tree Plus is distributed in co 

data integrity. Recovery of all committed transactions aftera especially known for its port 
failure is automatic. Other features include: intermediate policy. Whether for single 
transaction save points; full commit and rollback; roll forward application development, 
from clean backup; dynamic dump capability; netbios; shared processing is included in 
memory; TCP/IP; more throughput; minimized network of features. _ 
traffic; batch operations; high-level ISAM index searches =. 
offloaded to server; super files; key level index locking; disk 
caching; large file capacities (4 GB). | | 












Are you considering Oracle, Sybase, Novell, Gupta, Microsoft or any 
other SQL server? Compare FairCom's value: 
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$295 $495 $ 995 
$495 $995 $1,595 


Unlimited 
$445 $745 $1,495 
$745 $1,495 $2,395 


other platforms available 


Users: 
DOS/WINDOWS: 
Intel Unix-OS/2: 





































The typical Japanese day y But recently, more people 
starts off with a bowl of we: — been enjoying coffee 
: toast 


A Message rice and miso soup eaten 
with chopsticks. 
from 


Dear 
Programmers 
P ublishers: on Wid We always take our 


shoes off inside the 
house, you know! 


Lots of people take the trains "Good morning" in Japanese 
and subways to work in the if sounds like 'Ohio." 


Comparable in popularity 
to the IBM PC in the U.S., 
the NEC PC-9801 series 
are the most widely used 
computers here in Japan. 


The PC-9801 computers 
have the same Intel 8086 
series CPUs as IBM 
computers and run 
MS-DOS, but most IBM PC 
applications cannot be 
executed mainly because 
of BIOS differences. 


However, MICRO DATA, INC. There is a tremendous number No doubt, these applications 
has recently produced emulator of high quality IBM PC appeal to the Japanese market! 
software called 'COLOBOCKLU.’ | | applications for many fields. 


COLOBOCKLU enables the 
PC-9801 series computers 

to run many DOS applications 
directly without any changes. 















1.Sell Unconverted Software. 
meen, From the point of Japanese 
ora = Style and preference to 

man USer interfaces and message 
oe w titles, many Japanese are 

hesitant to use these applications. 
This method penetrates part of the 
Japanese software market. However, 
it may have potential for Japanese 
who know English well. 
COLOBOCKLU by MICRO DATA provides 
support for this part of the market. 
Many famous programs, and also PDS, 
run well with our software emulator 
with almost no changes required. 


2. Sell Converted Software. 


This method works well for 
software with a large 
market potential, but it 
requires a lot of effort 
and labor. For example; 

- Consideration of Japanese preference 
in user interface style. 

-Highly technical knowledge of 
conversion for the PC-9801, including 
the Japanese Front End Processor. 

- Manual translation into the Japanese 


language. 
Several Software applications 


successfully used this conversion 
method. Some examples are: 
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Lotus 1-2-3 (Lotus Development) 
Wordstar (Wordstar International) 
BRIEF (Solution Systems) 
dBASE (Borland International) 
Turbo C (Borland International) 
MS-Multiplan (Microsoft) 

MS-Works (Microsoft) 

MS-Word (Microsoft) _..... etc. 








All brand or product names mentioned are trademarks or 
registered trademarks of their respective holders. 





Are you interested in 
joining our prajects? 
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MICRO DATA, INC. 
based in Tokyo, is 
* > a leading software 
company in Japan. Our main 
products are quality software. 
For the past two years we 
have received the Golden 
Award from Nikkei Business 
Publications for outstanding 
software. Our file management 
program, "ECOLOGY II", was 
rated #2 for Best Evaluation 
#4 for Total 
Ranking, among all PC 
software in Japan by 240,000 
readers. 
For the last several years, 
MICRO DATA has been 
converting IBMPC software 
products, and _ successfully 
marketing them in Japan. We 
converted a disk optimizing 
program for Mark Elfield and 
Associates, Inc. from the U.S., 
and marketed it as 
"NOSTRADAMUS" in Japan 
with good results. We have 
also converted software and 
worked with many other 
companies as well. 
MICRO DATA wants to continue 
providing quality IBMPC 
software customized for the 


USA MICRO DATA, INC., 
VOICE: (415)206-0100 

(leave a message) 

: (415)206-0101 


or directly FAX, 
MICRO DATA, INC., 
Headquarters Tokyo, Japan 
FAX: 011-813-3208-5601 


FAX 


g 


Re. eee fl & XN _, 


Fumio Yamanaka 
International Director 


=o 
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ilicon Graphics has implemented 

parallel extensions to the C com- 

piler for the 4D Power Series 

computers that enable both SIMD 

(loop) and MIMD (independent 
block) parallelization. Code containing 
these extensions can be ported into se- 
rial C environments, recompiled, and 
executed without change. In direct con- 
trast to older methods of parallelization 
that generally involved heavy use of 
platform-specific system calls combined 
with code restructuring, parallelized 
code using IRIS Power C retains its se- 
rial structure. In this article, I will intro- 
duce you to the basics of parallelizing 
C code with IRIS Power C, discuss da- 
ta dependence as an important limita- 
tion, and end with two program exam- 
ples to convince you that the phrase 
“portable parallel C” is not an oxymoron. 


Parallelization 

I'll start this discussion by describing in 
general terms how SGI does parallelism 
on a shared-memory multiprocessor 
workstation. A parallelized program is 
a serial program that contains regions 
of parallel execution. On entering a par- 
allel region, the serial program is trans- 
formed from a single process to multi- 
ple processes, each running one of the 
parallel-execution threads. These regions 
and their parallel-execution behavior are 
determined by #pragma directives in- 
cluded in the code and interpreted by 


Barr uses high-performance compu- 
ters to design pharmaceuticals for 
Schering-Plough Research Institute. 
He can be reached at 60 Orange 
Street B1-3-85, Bloomfield, NJ 07005. 
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Parallel C 
Extensions 


Parallelized code retains ts serial 


structure 


Barr £. Bauer 


the compiler with the -mp option; oth- 
erwise they are ignored. 

At the system level, changes have 
been made to UNIX to support process- 
based parallelism on one hand and mul- 
tiple processors on the other, while 
maintaining system generality. A paral- 
lel program’s processes (threads) are al- 
lowed access to shared memory. They 
maintain data structures that transpar- 
ently coordinate interprocess data ac- 
cess and process synchronization. Oth- 
erwise, they behave like other processes 
in the scheduler’s queue. Parallel exe- 
cution itself is done in sched, the UNIX 
preemptive-multitasking scheduler, 
which has been modified to distribute 
processes over multiple processors 
rather than the usual one. A parallelized 
program’s threads timeshare with other 
tasks run by the system, either parallel 
or serial, resulting in a flexible execu- 
tion environment. On an eight-proces- 
sor system, this environment can exe- 
cute eight serial programs in the time 
of one, a single parallelized program up 
to eight times faster, or a mix of serial 
and parallel programs significantly faster. 


Parallel Models 

SGI implements the SIMD and MIMD 
parallel models. SIMD is the easiest to 
understand and implement. In a for loop 
that processes members of an array, all 
threads execute the same code on dif- 
ferent array elements. The individual it- 
erations of the loop are divided between 
the available threads and executed on 
them. For instance, if four threads are 
available, a 1000-iteration loop is par- 
allelized by executing iterations 0-249 
by thread 0, 250-499 by thread 1, and 


so forth. If each iteration has the same 
execution time, a four-thread system can 
be close to four times faster than serial 
execution. IRIS Power C currently lim- 
its loop parallelization to the for-loop 
construct because it is the only loop 
construct within C which lets you de- 
fine an index, a maximum number of 
iterations, and an increment. 

MIMD is implemented as indepen- 
dent-block parallelism. A block of code 
within a function can be executed on 
its own thread, independent of other 
blocks. In contrast to loop paralleliza- 
tion, the code in each block may oper- 
ate differently on global and local (pri- 
vate) data. Program speedups are less 
easily determined, being highly depen- 
dent on the nature of the code. Imple- 
mentation of MIMD parallelization is 
limited by the lack of a suitable pro- 
gramming landmark, such as a loop, to 
attract attention. 

The two models can be mixed and 
matched in a program either in sepa- 
rate parallel regions or even within a 
single parallel region. Two threads might 
execute independent blocks of code us- 
ing the MIMD model, while six threads 
execute a for loop using the SIMD mod- 
el. The limiting factor seems to be keep- 
ing the program structure straight. To 
avoid confusion, I use one model per 
parallel region. 


Parallel Data Types and Data Dependence 
Variables within code executed in par- 
allel are either shared between threads 
or local to a specific thread. Shared 
means just that: Each execution thread 
points to a single location in shared 
memory associated with that variable. 
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> The industry standard for 
text mode user interfaces 
Since 1989, thousands of programmers have 
chosen MEWEL as their text mode user 
interface. It’s the only window library which 
fully supports the SAA/CUA look and feel 
and gives unparalleled portability to 
Microsoft Windows. Companies like Fifth 
Generation Systems, Lotus, Solution Systems, 
American Airlines, Novell, and Intel have 
used MEWEL in successful products and 
applications. 





& SAA/CUA Compatibility 
MEWEL supports all of the window types 
you find in Microsoft Windows and SAA/ 
CUA. 


Pushbuttons(shadowed, multiline), 
checkboxes, radio buttons, listboxes 
(scrollable, multi select, multi column, owner 
drawn), scrollbars (horizontal, vertical, 
control), edit (single and multiline, 
wordwrap), static (text, frame, box, icons), 
multi-level menus, full mouse support, 
dialog boxes (modal, modeless), message 
boxes, and combo boxes (all 3 types). 


No other windowing library supports all of 
these controls. Perform subclassing to create 
your own controls! The Multiple Document 
Interface (MDI) is supported too! 


r Interface and 
Instant Portability | 


to Windows 





MEWEL supports tiled and overlapping 
windows. Complex clipping between 
windows. Support for MDA, CGA, EGA, 
VGA, Super VGA, and custom video modes. 


,Windows | 
Compatible API 

The MEWEL API is compatible with the 
Microsoft Windows API, the industry 
standard. Port between MEWEL and 
Windows just by recompiling! Compatible at 
the function, message, data structure, and 
resource level. Hundreds of companies use 
MEWEL to go between text mode and 
Windows. 


» 3rd Party Support 

Since MEWEL uses the Windows API, you 
can use sophisticated prototyping tools like 
Case:W, WindowsMaker, and Resource 
Workshop to generate both your Windows 
and your text mode applications. Can easily 
integrate with database libraries like DB Vista 
and Paradox. _ 











About2 


a 


F £ Color ———— 
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| | @ Cyan 

© flea 
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P Multi-Platform 
The same MEWEL source code can run on 
DOS, DOS extenders, OS/2, UNIX and VMS. 
Imagine your Windows code 
working on all these environ- 
ments! 





Magma Systems 
15 Bodwell Terrace, Millburn, NJ 07041 
voice 


(201) 912-0192 
(201) 912-0668 
(201) 912-0103 


BBS 
fax 


(lots of demos) 





MEWEL 3.4 for DOS 
with full source code 
libraries only $295 
UNIX, OS/2, VMS call for pricing 

Please add $5 for shipping, $25 outside US. 

COD, VISA, and MasterCard accepted 


$595 





Microsoft, Borland, Zortech, 


Supports: 


TopSpeed C/C++ 











& Graphics 

MEWEL can run perfectly in DOS graphics 
modes. In fact, Ferrari Electronics, a German 
manufacturer of FAX cards, ships both 
Windows and MEWEL based FAX 


itmaps 


Hindouws 





previewing software with their cards. Our 
“GDI wrapper” interfaces MEWEL with 
several popular graphics libraries, including 
Microsoft, BGI, and GX. 


C++ Users 


MEWEL can be used with Borland’s OWL, 
Inmark Development’s zAPP, and the 
Microsoft AFX class libraries. Or, create your 
own C++ classes for MEWEL. 





» Unparalleled Support 

We maintain a 24 hour BBS which can be 
accessed for demos and tech support. We also 
have vendor support conferences on BIX 
(join ‘magma’ ) and CompuServe (GO 
PCVEND). We have tons of demos, so you 
can try without buying. 


» MEWEL Toolbox 

Add-on package contains sophisticated data- 
entry fields, interface to MS Help Advisor for 
hypertext based help systems, and several 
different control classes. Call for details. 








PROGRAMMER'S WORKBENCH 


—— EEE A TTT 


(continued from page 110) 

The values within the variable persist 
past the parallel region of the program. 
Unless access to the code containing 
the shared variable is restricted, all 
threads can read or write to the vari- 
able. Shared array variables are com- 
mon in loop parallelism. 

Local variables within the thread point 
only to a private copy of the variable. 
The local variable exists in all threads 
created within the parallel region. The 
values within a local variable are not 
defined beyond the parallel region. Lo- 
cal variables are used exclusively with- 
in the parallel region and can be of any 
data type. Loop indexes and temporary 
variables are often local. 

For parallelism to work correctly, the 
code in each thread must not modify 
the values of variables in use by other 
threads. If this happens, the variable be- 
comes dependent on more than one 
thread, its value becomes unpredictable, 
and the results possibly incorrect. An 
example in SIMD parallelization is when 
each iteration processes a unique array 
element, such that the iteration’s exe- 
cution order does not affect the results: 
Iteration i processes shared variable a/i/, 
iteration i+7 does a/i+1/, and so forth. 

Two specific situations lead to de- 
pendencies: modification of shared 
scalar variables during parallel execu- 
tion, and recurrence. (There are other 
situations, but these two are the most 
troublesome.) The shared scalar-vari- 


for (sum=i=@; i<max; i++) 


sum += ali]; 





Example 1: Shared scalar-variable 
sum. 


for Gini. tees dee) 
alit-]rpinpeewra hat; 





Example 2: Typical case of recur- 
rence, where one iteration of a loop 
depends on the previous iteration. 


Code Block 


becomes 


#pragma parallel 


able sum appearing in the sum reduc- 
tion (see Example 1) might have thread 
0 reading one value of sum, while 
thread 1 writes a new value, and the 
like. The value of sum is not systemat- 
ically determined, leading to wrong re- 
sults. Controlling thread access to the 
statement containing the dependence 
will prevent this type of dependence, 
which can occur in both loop and block 
parallel constructs. 

Recurrence happens when one iter- 
ation of a loop depends on a previous 
iteration; see Example 2. There is no 
good way to correct it besides rewrit- 
ing the code or running the loop in se- 
rial. Generally, the 7-7 (or i+ D index- 
ing identifies a recurrence. 

To test for dependence in loop par- 
allelization, index the for loop back- 
wards during serial execution. If the re- 
sult is correct, there are probably no 
dependencies. 


Fundamentals 
You introduce parallelism into a pro- 
gram in two steps. First, define the re- 
gion in the source code in which par- 
allelism will be active and declare the 
parallel behavior of all variables for the 
region. Second, select for loops or 
blocks located within the parallel region 
for parallelization. The first step estab- 
lishes the environment and the second 
speeds up execution. Multiple for loops, 
independent blocks, or combinations 
can be established within a single par- 
allel region, and a program can have as 
many parallel regions as necessary. 
The most time-consuming part of 
the program is the best place to in- 
troduce the parallel region. Speedup 
of this block will generally result in a 
dramatic improvement in overall pro- 
gram performance, even if it is a rel- 
atively small amount of code. Paral- 
lelization of less-sensitive blocks may 
yield little if any performance return. 
Profiling serial execution locates the 
desired “hot spots.” 


#pragma shared (arrayvarl, arrayvar2..., grandtotal) 
#pragma local (index1l, index2, subtotall,...) 
#pragma byvalue(shared_constanti,...) 

{ 


Code Block 


or, the equivalent 


#pragma parallel shared(list) local(list) byvalue(list) 
{ 


Code Block 





Example 3: Format of a typical parallel region. 
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The format of a typical parallel re- 
gion is shown in Example 3. Place the 
parallel-region code into a code block 
preceded by #pragma parallel and its 
modifiers. The main directive and its 
modifiers can be spread out or placed 
on one line. I prefer the spread-out ver- 
sion because it handles long variable 
lists better. 

Declare the behavior of variables with- 
in the parallel region with the modifiers 
to #pragma parallel. The lists consist of 
the variables of any given type, sepa- 


Independent block 
ordering is critical 
for successful 
portability 





rated by commas. The byvalue modifi- 
er is a special shared type used for 
scalars that maintain constant value with- 
in the parallel region, such as a for loop 
bounds limit, yielding a net savings on 
overhead. All values not declared are as- 
sumed to be shared, but fully declaring 
all variables helps clarify intent. 

The parallel region treats code out- 
side the loop or block parallel constructs 
as local to each thread. Local code is 
duplicated in and executed by all 
threads. Local code is useful, but can 
result in incorrect execution. For in- 
stance, the for loop in Example 4 is part 
of the local code resulting in sum = 
50000 four times using four threads, 
which may not be what you intended. 
Also, the value of 7 outside the parallel 
region is not defined. 


The pfor Block 

Within the parallel region, one or more 
for loops ean be set for loop paral- 
lelization. Only those for loops in which 
the exact number of iterations is known 
at compile time can be parallelized. To 
take advantage of loop parallelization, 
while and do while \oops must be rewrit- 
ten into for loops, if possible. 

The general format for parallelizing a 
simple for loop is shown in Example 5. 
The #pragma pfor directive precedes 
the unchanged for loop placed within 
a code block. The pfor directive takes 
the mandatory modifier iterate, which 
identifies the index variable, the start- 
ing value of the index, the maximum 
number of iterations, and the value of 
the increment. Note that iterate syntax 
looks like, but is not the same as the 
for statement syntax. 

The above transformations can be 
ported from the Power Series environ- 
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ment to other serial environments. In a 
serial environment, the compiler ignores 
the two pragmas (typically with warn- 
ing messages), as well as the addition- 
al code block created by the added 
braces, and compiles the original for 
loop. The resulting code executes as ex- 
pected without removal of the paral- 
lelism directives. The compiler on a 
Power Series machine, with multipro- 
cessing (-mp) active, transformed the 
code for parallel execution. 


The Independent Block 


Block parallelization allows for paral- 
lelization of C constructs like linked-list 
manipulation and other operations not 
amenable to loop parallelization. The 
principal weakness is a lack of an at- 
tention-drawing landmark, such as a for 
loop, to provide hints as to the paral- 
lelization approach. 

Independent blocks are created by 
placing the selected block code within 
a code block preceded by #pragma in- 
dependent. Each independent block cre- 
ated executes on a single thread. Any 
number of blocks can be created in a 
single parallel region. If the available 
threads outnumber the blocks, each ex- 
ecutes on its own thread and the re- 
maining threads move on to the next 
parallel construct in the parallel region. 
If blocks outnumber threads, then the 
independent blocks execute as threads 
become available. If a pfor block follows 
a group of independent blocks in a sin- 
gle parallel region, the execution be- 
havior can potentially become complex. 


for (i=@; i<max; itt) 
piel. = Gonetta li) 


becomes 


In Example 6, code blocks 1 and 2 will 
execute simultaneously, while any ad- 
ditional threads, such as would be found 
on a four-processor machine, will idle. 

Independent-block parallelization is 
also portable. A serial compiler ignores 
the pragmas (again with warning mes- 
sages) and the added code blocks cre- 
ated by the braces and processes the 
original two code blocks. The serial ex- 
ecution behavior remains block 1, then 
block 2. Block order is important for 
maintaining portability: If, for instance, 
block 1 adds elements to a linked list, 
and block 2 removes elements off the 
same linked list, parallel execution may 
be correct regardless of the starting or- 
der. However, serial behavior will be 
wrong if block 2 tries to remove ele- 
ments from a linked list not yet filled 
with elements by block 1. 


Critical Blocks 

Shared scalar variables present a prob- 
lem when used within a parallel region. 
Any thread can write to the scalar in 
any order. This behavior can be parttic- 
ularly chaotic for sum-reduction loops. 
For instance, the parallelized loop in Ex- 


/* strictly local code */ 
#pragma parallel 

#pragma local(i, j) 

{ 


for (i=sum=0; i<1000; i++) 
sum += i; 
printf("sum = %d\n", sum); 


Example 4: Local code can result in 
unexpected execution. 


#pragma parallel shared(a,b) local(i) byvalue(max,const) 


#pragma pfor iterate(i=@; max; 1) 


for .(i=0% a <max ?" 1++) 


bli] = const*a[i]: 





Example 5: Parallelizing a simple for loop. 


code block i 
code block 2 


becomes 


#pragma parallel shared(list) local(list) byvalue(list) 
{ 


#pragma independent 


code block i 


#pragma independent 


code block 2 





Example 6: Code blocks 1 and 2 will execute simultaneously. Any additional 


threads will idle. 
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| HEADACHES, 
HASSLES, 
HVPE. 


THOUSANDS OF DEVELOPERS depend on 
Overlay Architect and Overlay Optimizer 
(the Overlay Series) to quickly and 
easily design, build and maintain 
performance-tuned compact, reliable 
overlay structures. Tools that automate 
and expand the use of your favorite 
linker (specify Plink86+, .RTLink, or 
RTLink+), the Overlay Series is the 
painless way to make your complex 
program small, fast and easy to link. 


produces nested overlays and multiple overlay areas 
automatically handles indirect calls (function pointers) 
detects unused code 





automates data overlays 

gives you the power to fine-tune your overlay structure 

runs fast—10 times faster than Plink or .RTLink! 

provides precision control over your linker's treatment of 

individual symbols 

enables Plink or .RTLink to outperform VML and Blinker 
286 Overlay Series supports up to 16,000 code symbols 

(DOS/ 0S/2) $289 

386 Overlay Series supports up to 64,000 symbols/32 

bit performance $339 


CALL: 303-958-lcl0 
FAX: 303-447-1475 


FULL 30 DAY MONEY-BACK GUARANTEE 


We accept VISA & MC 


:) MAT 


ee ORE SO ee 


© 1992, AtLast! Software, Inc. Overlay Architect ond Overlay Optimizer are trademarks of 
AtLast! Software, Inc. All other trademarks are property of their respective owners 
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ample 7 will execute incorrectly. One 
thread could be reading the value of 
sum while others are writing, making 
the value of sum unpredictable. 

Isolate data dependencies by placing 
the statement containing the depen- 
dence within a code block preceded by 
#pragma critical. Only one thread at a 
time has access to the code within the 
critical block; threads that arrive at the 
critical block while it is occupied wait 
until it is free. The code within the crit- 
ical block is ultimately executed by all 
threads. 

Portability for the critical block into 
the serial environment is maintained. 
The serial compiler ignores both #prag- 
ma critical and the additional code 
blocking and sees only the simple /or 
loop conducting a summation. Critical 
blocks make it possible to parallelize 
code afflicted with dependencies. This 
mechanism can also be applied to in- 
dependent blocks, local code, and 
mixed constructs. 

Applications such as a sum reduction 
containing a critical block may perform 
poorly if more than one thread waits 
during execution, as is often the case 
for simple code. The problem and an 
effective cure are presented in Listings 
One and Two (page 124). 


Examples 

The first case is an expanded version of 
the sum-reduction example that demon- 
strates how to both manage a depen- 
dence and use local code to improve 
performance. These are “before” (List- 
ing One) and “after” (Listing Two) ex- 


sum = 0.0; 
#pragma parallel shared (sum, a) local(i) byvalue(max) 
‘ 


amples, both of which fill arrays that are 
then used in a sum reduction. The kth 
loop takes the place of real code and 
lengthens the execution time of each 
thread enough to overcome paralleliza- 
tion overhead. In Listing One, the de- 
pendence is isolated within a critical 
block located within the deepest (jth) 
loop nest. In Listing Two, the reduction 
was rewritten to determine a depen- 
dence-free subtotal for each thread, fol- 
lowed by a dependent grand summa- 
tion in local code isolated within a critical 
block. Table 1 shows the effect on per- 
formance of moving the critical block 
out of the innermost loop and into lo- 
cal code on a four-processor 4D/240. 

Each example was compiled without 
change into serial and parallel versions. 
Both parallel versions exhibit speedups 
over the serial versions, with close but 
not identical results (read on, please). 
Moving the critical block out of the pfor 
block results in a substantially faster par- 
allel version of Listing Two. The high 
CPU utilization relative to an actual 
speedup in the parallel version of List- 
ing One reveals the time threads spent 
in contention for the critical block. There 
is a small discrepancy in the computed 
sum between the parallel and serial ver- 
sions. This is a result of different sum- 
mation sequences, which produce dif- 
ferent round-off error accumulations. 
Round-off error is a problem for paral- 
lel reductions, precision-sensitive cal- 
culations cannot be done in parallel if 
identical results between parallel and 
serial calculations are required—a lim- 
itation to portability. 


#pragma pfor iterate (i=; max; 1) 
gy ge ala ela Bs SA Ba 


ain ta he 





Example 7: Using a shared scalar variable within a parallel region. 


73.8 
147.9 
74.0 
ThE 


1. Example1 (serial) 
2. Example1 (parallel) 
3. Example2 (serial) 
4. Example2 (parallel) 





Table 1: Sum-reduction execution results. 





Consumer 1 
Consumer 2 
Consumer 3 


Distribution* 


List depth* 
Time 

CPU utilization 
Speedup 


*Units are “products.” 


6039797.7653373638 
6039797.7643598625 
6039797 .7653373638 
6039797 .7641029358 


10,000 


Table 2: Execution results for the producer-consumer problem. 
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Listing Three, page 124, shows a more 
elaborate use of independent blocks to 
process a linked list in the producer- 
consumer problem. In this example, a 
single producer fills a pipe (FIFO queue) 
with “products” consumed by multiple 
“consumers” running in their own in- 
dependent blocks. The product is a da- 
ta structure. In this case, it holds an int, 
but it could hold more complicated da- 
ta. Critical blocks limit access to the root 
pointer of the linked-list to one pro- 
ducer/consumer(s) at a time. The pro- 
ducer and consumers all begin at the 
start of the simulation. The simulation 
ends when the consumers each receive 
a stop token sent with the product. The 
producer loads three tokens into the 
queue before quitting. Use of stop to- 
kens instead of a null pointer to termi- 
nate the simulation allows the pipe to 
occasionally run dry when, for instance, 
the consumers are faster than the pro- 
ducer. 

The execution results of Listing Three 
are shown in Table 2. The execution 
threads were varied while retaining the 
four independent blocks. Execution re- 
sults of the serial version are identical 
to those of the single-thread version. 
Note that even the two-thread case (one 
active consumer, the other two wait) 
achieves substantial speedups relative 
to the serial version, although perfor- 
mance substantially improves with more 
active consumers. 

Independent block ordering is criti- 
cal for successful portability. The pro- 
ducer (the first independent block) com- 
pletely fills the queue, then quits. The 
first consumer (the second independent 
block) then consumes all the products 
in the queue, then quits. The remain- 
ing two consumers are left only with 
their stop tokens. 

Listing Three is an example of hetero- 
parallelism, the most complicated form 
of parallelism, portable into the serial 
environment. Program structure remains 
clear, and parallel performance gains 
are substantial. Nonobviousness will be 
the main barrier to implementing block 
parallelization. 

All three examples compile in the 
CONVEX C200 and 386-PC environ- 
ments. All executed correctly as serial 
programs with the exception of Listings 
One and Two on the PC, which lacked 
38 Mbytes of available memory. 


Additional Reading 
Bauer, B.E. Practical Parallel Program- 
ming. San Diego, CA: Academic Press, 
1992. 
DDJ 
(Listings begin on page 124.) 
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BIG-NUMBER ARITHMETIC 


Listing One (Text begins on page 40.) 


/* Copyright (C) 1991-2 RSA Laboratories, a division of RSA Data 

Security, Inc. All rights reserved. +*/ 
/* PROTOTYPES should be set to one if and only if the compiler supports 
function argument prototyping. The following makes PROTOTYPES default to @ 
if it has not already been defined with C compiler flags. */ 


#ifndef PROTOTYPES 
#define PROTOTYPES @ 
#endif 


/* UINT2 defines a two byte word */ 
typedef unsigned short int UINT2; 


/* UINT4 defines a four byte word */ 
typedef unsigned long int UINT4; 


/* PROTO_LIST is defined depending on how PROTOTYPES is defined above. If using 
PROTOTYPES, PROTO_LIST returns the list, otherwise it returns empty list. */ 

#if PROTOTYPES 

#define PROTO_LIST(list) list 

#else 

#define PROTO_LIST(list) () 

#endif 


/* RSA key lengths. */ 
#define MAX_RSA_MODULUS_BITS 1024 
#define MAX_RSA_MODULUS_LEN ((MAX_RSA_MODULUS_BITS + 7) / 8) 


typedef UINT4 NN_DIGIT; 
typedef UINT2 NN_HALF_DIGIT; 


/* Length of digit in bits */ 
#define NN_DIGIT_BITS 32 
#define NN_HALF_DIGIT_BITS 16 
/* Length of digit in bytes */ 
#define NN_DIGIT_LEN (NN_DIGIT_BITS / 8) 
/* Maximum length in digits */ 
#define MAX_NN_DIGITS \ 
( (MAX_RSA_MODULUS_LEN + NN_DIGIT_LEN - 1) / NN_DIGIT_LEN + 1) 
/* Maximum digits */ 
#define MAX_NN_DIGIT Oxffffffff 
#define MAX_NN_HALF_DIGIT @xffff 


/* Macros. */ 
#define LOW_HALF(x) (NN_HALF_DIGIT) ((x) & MAX_NN_HALF_DIGIT) 
#define HIGH_HALF(x) \ 

(NN_HALF_DIGIT) (((x) >> NN_HALF_DIGIT_BITS) & MAX_NN_HALF_DIGIT) 
#define TO_HIGH_HALF(x) (((NN_DIGIT) (x)) << NN_HALF_DIGIT_BITS) 
#define DIGIT_MSB(x) (unsigned int) (((x) >> (NN_DIGIT_BITS - 1)) & 1) 
#define DIGIT_2MSB(x) \ 

(unsigned int) (((x) >> (NN_DIGIT_BITS - 2)) & 3) 


/* NOTE: A bug in the MPW 3.2 C compiler causes an incorrect sign extension in 
the routine NN_DigitDiv. To overcome this bug, change the definition of the 
macro HIGH_HALF to: 

#define HIGH_HALF(x) (((x) >> NN_HALF_DIGIT_BITS) & MAX_NN_HALF_DIGIT) */ 


void NN_Assign PROTO_LIST ((NN_DIGIT *, NN_DIGIT *, unsigned int)); 
void NN_AssignZero PROTO_LIST ((NN_DIGIT *, unsigned int)); 


NN_DIGIT NN_Add PROTO_LIST 

((NN_DIGIT *, NN_DIGIT *, NN_DIGIT *, 
NN_DIGIT NN_Sub PROTO_LIST 

((NN_DIGIT *, NN_DIGIT *, NN_DIGIT *, unsigned int)); 
void NN_Mult PROTO_LIST 

((NN_DIGIT *, NN_DIGIT *, NN_DIGIT *, unsigned int)); 
int NN_Cmp PROTO_LIST ((NN_DIGIT *, NN_DIGIT *, unsigned int)); 
unsigned int NN_Bits PROTO_LIST ((NN_DIGIT *, unsigned int)); 
unsigned int NN_Digits PROTO_LIST ((NN_DIGIT *, unsigned int)); 


unsigned int)); 


NN_DIGIT NN_LShift PROTO_LIST 

((NN_DIGIT *, NN_DIGIT *, unsigned int, unsigned int)); 
NN_DIGIT NN_RShift PROTO_LIST 

((NN_DIGIT *, NN_DIGIT *, unsigned int, unsigned int)); 
void NN_Div PROTO_LIST 

((NN_DIGIT *, NN_DIGIT *, 

unsigned int)); 

NN_DIGIT NN_AddDigitMult PROTO_LIST 

((NN_DIGIT *, NN_DIGIT *, NN_DIGIT, NN_DIGIT *, 
NN_DIGIT NN_SubDigitMult PROTO_LIST 

((NN_DIGIT *, NN_DIGIT *, NN_DIGIT, NN_DIGIT *, unsigned int)); 
unsigned int NN_DigitBits PROTO_LIST ((NN_DIGIT)); 
void NN_DigitMult PROTO_LIST ((NN_DIGIT [2], NN_DIGIT, NN_DIGIT)); 
void NN_DigitDiv PROTO_LIST ((NN_DIGIT *, NN_DIGIT [2], NN_DIGIT)); 


End Listing One 


NN_DIGIT *, unsigned int, NN_DIGIT *, 


unsigned int)); 


Listing Two 


/* Copyright (C) 1991-2 RSA Laboratories, a division of RSA Data 
Security, Inc. All rights reserved. */ 

/* Computes a=bt+c. Returns carry. Lengths: aldigits], b[digits], c[digits]. */ 
NN_DIGIT NN_Add (a, b, c, digits) 
NN_DIGIT *a, *b, *c; ° 
unsigned int digits; 
{ 

NN_DIGIT ai, carry; 

unsigned int i; 


®; i< digits; it+) { 


if ((ai = b[i] + carry) < carry) 
ai = c[il; 
else if ((ai += c[i]l) < c[i]) 
carry = 1; 
else 
carry = @; 
a[i] = ai; 
) 
return (carry) ; ae 
End Listing Two 
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Listing Three 


/* Copyright (C) 1991-2 RSA Laboratories, a division of RSA Data 
Security, Inc. All rights reserved. */ 

/* Computes a=b-c. Returns borrow. Lengths: 
NN_DIGIT NN_Sub (a, b, c, digits) 
NN_DIGIT *a, *b, *c; 
unsigned int digits; 
{ 

NN_DIGIT ai, borrow; 

unsigned int i; 


aldigits], b[digits], c[digits]. */ 


borrow = @; 


for (i = @; i < digits; i++) f 
if ((ai = bli] - borrow) > (MAX_NN_DIGIT - borrow) ) 
= MAX_NN_DIGIT - cli]; 
else if ((ai -= c[i]) > (MAX_NN_DIGIT - c[i])) 
borrow = 1; 


else 
borrow = @; 
afi] = ai; 


} 


return (borrow); 


End Listing Three 
Listing Four 


/* Copyright (C) 1991-2 RSA Laboratories, a division of RSA Data 
Security, Inc. All rights reserved. */ 
/* Computes a=b*c, where b and c are digits. 
void NN_DigitMult (a, b, c) 
NN_DIGIT a[2], b, c; 
NN_DIGIT t, u; 


Lengths: a[2]. */ 


NN_HALF_DIGIT bHigh, bLow, cHigh, cLow; 
bHigh = HIGH_HALF (b); 

bLow = LOW_HALF (b); 

cHigh = HIGH_HALF (c); 


cLow = LOW_HALF (c); 


[8] 


® (NN_DIGIT) bLow * (NN_DIGIT) cLow; 
1 


a = 
t (NN_DIGIT)bLow * (NN_DIGIT)cHigh; 
u = (NN_DIGIT)bHigh * (NN_DIGIT)cLow; 
a[1] = (NN_DIGIT)bHigh * (NN_DIGIT) cHigh; 
if ((t += u) < u) 

Pe += TO_HIGH_HALF (1); 

= TO_HIGH_HALF (t); 


if ((a[@] += u) < u) 
alii; 
a[1] += HIGH_HALF (t); 


End Listing Four 
Listing Five 


/* Copyright (C) 1991-2 RSA Laboratories, a division of RSA Data 
Security, Inc. All rights reserved. */ 
/* Sets a=b/c, where a and c are digits. 
and HIGH_HALF (c) > @. For efficiency, 
void NN_DigitDiv (a, b, c) 
NN_DIGIT *a, b[2], ¢; 
f 
NN_DIGIT t[2], u, v; 
NN_HALF_DIGIT aHigh, aLow, 


Lengths: b[2]. Assumes b[1] < c 
c should be normalized. */ 


cHigh, cLow; 


cHigh = HIGH_HALF (c); 
cLow = LOW_HALF (c); 

t (0) = b[O]; 

t{1] = b[1]; 


/* Underestimate high half of quotient and subtract product 
of estimate and divisor from dividend. */ 
if (cHigh == MAX_NN_HALF_DIGIT) 
aHigh = HIGH_HALF (t([1]); 
else 
aHigh = (NN_HALF_DIGIT) (t[1] / (cHigh + 1)); 
= (NN_DIGIT)aHigh * (NN_DIGIT)cLow; 
= (NN_ DIGIT) aHigh * (NN_DIGIT) cHigh; 
( 0 ] - = TO_HIGH_HALF (u)) > (MAX_NN_DIGIT - TO_HIGH_HALF (u))) 
1 


ri 
a: 
t [1] HIGH HALF (u); 
f= 


no 
s a 


/* Correct estimate. */ 
while ((t[1] > cHigh) || 
((t[{1] == cHigh) && (t[@] >= TO_HIGH_HALF (cLow)))) { 


if ((t[@] -= TO_HIGH_HALF (cLow)) 
> MAX_NN_DIGIT - TO_HIGH_HALF (cLow) ) 
We 
t[1] -= cHigh; 
aHight+ ; 


} 
/* Underestimate low half of quotient and subtract product of 
estimate and divisor from what remains of dividend. */ 


if (cHigh == MAX_NN_HALF_DIGIT) 
aLow = LOW_HALF (t[1]); 

else 
aLow = 


(NN_HALF_DIGIT) 
((NN_DIGIT) (TO_HIGH_HALF (t[1]) + HIGH_HALF (t[@])) 
/ (cHigh + 1)); 


(continued on page 118) 
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BIG-NUMBER ARITHMETIC 


Listing Five (Listing continued, text begins on page 40.) 


(NN_DIGIT) aLow * (NN_DIGIT)cLow; 
(NN_ oo * (NN_DIGIT)cHigh; 
if ((t[@] -= u) > (MAX_NN_DIGIT - u)) 
t{1)==; 
if ((t[@] -= TO_HIGH_HALF (v)) > (MAX_NN_DIGIT - TO_HIGH_HALF (v))) 
tlal-=; 
t[1] -= HIGH_HALF (v); 


u 


/* Correct estimate. */ 
while ((t[1] > @) |; ((t[1] == ®) && t[0] >=c)) { 
if ((t[@) -= c) > (MAX_NN_DIGIT - c)) 
tit) ==3 


aLowtt; 


*a = TO_HIGH_HALF (aHigh) + aLow; 


End Listing Five 


Listing Six 


/* Copyright (C) 1991-2 RSA Laboratories, 
Security, Inc. All rights reserved. */ 
/* Computes a=b+c*d, where c is a digit. Returns carry. 
Lengths: aldigits], b[digits], d[digits]. */ 
NN_DIGIT NN_AddDigitMult (a, b, c, d, digits) 
NN_DIGIT *a, *b, c, *d; 
unsigned int digits; 
{ 
NN_DIGIT carry, t[2]; 
unsigned int i; 


a division of RSA Data 


carry = 0; 
for (i = @; i < digits; i++) { 
NN_DigitMult (t, c, d[il]); 
if ((ali] = b[i] + carry) < carry) 
carry = 1; 
else 
carry = @; 
if ((ali] += 
carrytt; 
carry += t[1]; 
} 


return (carry); 


t{@]) < t[@]}) 


End Listing Six 
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Listing Seven 


/* Copyright (C) 1991-2 RSA Laboratories, a division of RSA Data 


Security, Inc. All rights reserved. 

/* Computes a=b*c. Lengths: a[2*digits], b[digits], c[digits]. 

Assumes digits < MAX_NN_DIGITS. */ 
void NN_Mult (a, b, c, digits) 
NN_DIGIT *a, *b, *c; 
unsigned int digits; 
if 

NN_DIGIT t[2*MAX_NN_DIGITS] ; 

unsigned int bDigits, cDigits, i; 

NN_AssignZero (t, 2 * digits); 
bDigits = NN_Digits (b, digits); 
cDigits = NN_Digits (c, digits); 


for (i = @; i < bDigits; i++) 
t[itcDigits] += NN_AddDigitMult (&t[i], 
NN_Assign (a, t, 2 * digits); 


, blil, c, cDigits); 


End Listing Seven 


Listing Eight 


/* Copyright (C) 1991-2 RSA Laboratories, 
Security, Inc. All rights reserved. */ 
/* Computes a=b-c*d, where c is a digit. Returns borrow. 
Lengths: al[digits], b[digits], d[digits]. */ 
NN_DIGIT NN_SubDigitMult (a, b, c, d, digits) 
NN_DIGIT *a, *b, c, *d; 
unsigned int digits; 
{ 
NN_DIGIT borrow, t[2]; 
unsigned int i; 


a division of RSA Data 


borrow = @; 
for (4.=@: 4:< digite; i7+) { 
NN_DigitMult (t, c, d[i]); 
if ((a[i] = bli] - borrow) > (MAX_NN_DIGIT - borrow) ) 
borrow = 1; 
else 
borrow = Q; 
if ((a[i] -= t[@]) > (MAX_NN_DIGIT - t[@])) 
borrowtt; 
borrow += t[1]; 
} 


return (borrow); 


End Listing Eight 


Listing Nine 


/* Copyright (C) 1991-2 RSA Laboratories, 
Security, Inc. All rights reserved. */ 
/* Computes a=c div d and b=c mod d. Lengths: al[cDigits], b[dDigits], 
c[cDigits], d[dDigits]. Assumes d > ®, cDigits < 2 * MAX_NN_DIGITS, 
dDigits < MAX_NN_DIGITS. */ 
void NN_Div (a, b, c, cDigits, 
NN_DIGIT *a, *b, *c, *d; 
unsigned int cDigits, dDigits; 


a division of RSA Data 


d, dDigits) 


NN_DIGIT ai, dd [MAX_NN_DIGITS], t; 
int i; 
unsigned int ddDigits, 


ec [2*MAX_NN_DIGITS+1], 
shift; 


ddDigits = NN_Digits (d, dDigits); 
if (ddDigits == @) 
return; 


/* Normalize operands. */ 
shift = NN_DIGIT_BITS - NN_DigitBits (d[ddDigits-1]); 
NN_AssignZero (cc, ddDigits) ; 


ec[cDigits] = NN_LShift (cc, c, shift, cDigits); 
NN_LShift (dd, d, shift, ddDigits); 

t = dd[ddDigits-1]; 

NN_AssignZero (a, cDigits) ; 

for (i = cDigits-ddDigits; i >= @; i--) { 


/* Underestimate quotient digit and subtract. */ 
if (t == MAX_NN_DIGIT) 
ai = cclitddDigits] ; 
else 
NN_DigitDiv (&ai, 
ec[itddDigits] -= 
NN_SubDigitMult (&cc[il], 
/* Correct estimate. */ 
while (cc[itddDigits] |; (NN_Cmp (&cc[i], 
aitt+; 
ec[itddDigits] -= NN_Sub (&cc[i], 


&ee [itddDigits-1], t + 1); 


&ec [i], ai, dd, ddDigits); 

dd, ddDigits) >= 0)) { 

&ec[i], dd, ddDigits); 
afi] = ai; 

} 

/* Restore result. */ 


NN_AssignZero (b, dDigits); 
NN_RShift (b, cc, shift, ddDigits); 


End Listing Nine 
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Listing Ten 


NN_DIGIT NN_LShift (a, b, c, 


NN_DIGIT *a, *b; 


/* Copyright (C) 1991-2 RSA Laboratories, a division of RSA Data unsigned int c, digits; 
Security, Inc. All rights reserved. */ { 


/* Assigns a=@. Lengths: al[digits], 


void NN_Assign (a, b, digits) 
NN_DIGIT *a, *b; 
unsigned int digits; 


unsigned int i; 
for {3 = 8:2 < digite: i++) 
a[i] = bli]; 
} 


b[digits]. */ NN_DIGIT bi, carry; 
unsigned int i, t; 
if (c >= NN_DIGIT_BITS) 
return (Q); 
t = NN_DIGIT_BITS - ¢; 
carry = @; 
for (i = 
bi = blil; 
a{i] = 


carry =c ? (bi >> t) 


/* Assigns a = ®. Lengths: a[digits]. */ 


void NN_AssignZero (a, digits) 
NN_DIGIT *a; 

unsigned int digits; 

! 


unsigned int i; 


return (carry); 


} 


digits) 


O: i < digits; itt) { 


(bi << ¢) | Carry; 
: @; 


/* Computes a = c div 2%c (i.e., shifts right c bits), returning carry. 
Lengths: a[digits], b[digits]. Requires: c < NN_DIGIT_BITS. */ 


for (i = 0; i < digits; i++) NN_DIGIT NN_RShift (a, b, c, digits) 
ali] = 0; NN_DIGIT *a, *b; 
} unsigned int c, digits; 
/* Returns sign of a-b. Lengths: aldigits], b[digits]. */ NN_DIGIT bi, carry; 
int NN_Cmp (a, b, digits) int i; 
NN_DIGIT *a, *b; unsigned int t; 
unsigned int digits; if (c >= NN_DIGIT_BITS) 
return (0); 
int i; t = NN_DIGIT_BITS - c; 
for (i = digits - 1; i >= @; i--) { carry = @; 
if (a[i] > b[i]) for (i = digits - 1; i >= @; i--) { 
return (1); bi = bli]; 
if (a[i] < blil]) a[i] = (bi >> c) | carry; 
return (-1); carry =c ? (bi << t) : @; 
} 
return (@); return (carry); 
} } 
/* Returns the significant length of a in digits. Lengths: a[digits]. */ /* Returns the significant length of a in bits, where a is a digit. */ 
unsigned int NN_Digits (a, digits) unsigned int NN_DigitBits (a) 
NN_DIGIT *a; NN_DIGIT a; 
unsigned int digits; { 


int i; 


unsigned int i; 


for (i = 0; i < NN_DIGIT_BITS; it+, a >>= 1) 


for (i = digits - 1; i >= @; i--) if (a == 0) 
if (a[il) break; 
break; return (i); 


return (i + 1); 


} 


/* Computes a = b * 2%c (i.e., shifts left c bits), returning carry. 


Lengths: aldigits], b[digits]. 


ImageSett 


Requires c < NN_DIGIT_BITS. */ 


™ “ AMIST - ChangeL. ogBrowser {not checked oul] 
dit Smalital Bee Class ethods Variables _— 
pplicati 1age Upd Fe 
ChangeLogBro H 
ConfiguratianMenage i i 
Graphicvie i ; 
STMods _ | i 


Fin ST, developed by 


Systems Division of Coopers & 


Application subclass 


the SoftPert instance VariacieName : ? jet lates With Lock 


poolDictionaries: : gtalt* x i Without Lock 


Lybrand, enables the developer to 

manage large, complex, object-orient- 

ed applications. The AM/ST Appli- 

cation Browser provides multiple 

views of a developer's application. = 
AM/ST defines Smalltalk /V applica- 


tions as logical groupings 


of classes and 


methods which can be managed in source Execution tuning & rofiling tools; 
files independent of the Smalltalk / V Cross reference nents 

image. An application can be locked and archy; 

modified by one developer, enabling other Autsate documen tion; 
developers to browse the source code. The © Source control; 

source code control system manages multi- sg Static analysis joo 


ple revisions easily. 


Coopers 





. Dyn. lamic analysis ee 
Your choice of platforms: 


C&L &Ly rand | DOS, Windows, OS/2/PM & Mac. 


All trademarks are the property of their respective owners. ImageSoft, Inc., 2 Haven Avenue, Port Washington, NY 11050 516/767-2233; Fax 516/767-9067; UUCP address: mcdhup!image! info 
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End Listings 


The original and still premier 


application manager for 
Smalltalk/V.™ 


ChangeBrowser. As an additional 
tool available for Smalltalk /V PM 
and Smalltalk /V Windows, Change- 
Browser supports browsing of the 
Smalltalk /V change log file or any 
file in Smalltalk / V chunk format. 


The addition of AM/ST to the 
ImageSoft Family of software develop- 
ment tools enhances and solidifies 
ImageSoft’s position as — 

“The World’s Leading Publisher 
of Object-Oriented Software Gots 
Development Tools.” WINDOWS 


1-800/245- 8840 


The World’s Leading Publisher of Development Tools 
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ASM 10 ( 
A fast and powerful kernel, 


; Listing One (7ex begins ge 72.) 
unmatched stream device /O, isting One (Zext begins on page 


Cc 


1 / * Here is a method of determining how the compiler expects parameters 


ray 


{ 


1 y ' 1 2 tob d bet C and bl tines. F le, 
unique multitasking monitor > you need Go base 4 anteser: Go wi Geacably -oatine,. where Shey will 
4 be manipulated, and an integer value returned. The trick is to write 
4 5 a C routine that simulates this. Th ile t bl d h 
and profiler, PC prototyping, 6 the compiler treats the nether a bee We ace “ee 
7 assembly code as a template and add your code to it). * 
8 int h,i,j,k; 
portable across 15 processor s nain( 


11 h = subroutine(i,j,k); 
ldd _k ; push k on stack first 


families, and fantasic support! 


idd _j ; push j on stack next 


TM pshd 
te- ldd _i ; leave i in register 
jbsr _subroutine 


Multitasking Operating : -h ; result in register 


; 12 3} 
S { rts 
Ys em ; 13 dint subroutine(int.a, int b, int c) 
; 14 { 
_subroutine: 
bf pshx ; save X 
18 your path to pshd ; save i on stack 
tsx ; transfer SP to X 
° ] .set OFST=0 
real-time success! | 15 ceturn(atbte); 
ldd OFST+0,x ; code to copy i back to D 
addd OFST+6,x ; add j 
Byte-BOS is available as source code and can be used ddd ORD ITOye. 5 Ged Ky resale tert tn) 
: . pulx ; restore D into X from stack 
on all PCs at your site. Executable versions can be pulx ; restore X 
distributed with no royalties and our prices are great. 3 


Call today to order, or request a detailed information End Listing One 


packet. With the time and money you save, you’ll be listing Two | a a 
glad you left the multitasking to Byte-BOS. extern void reset(); / : symbol defined in startup routine */ 


extern void default (); * generic routine generates a return from interrupt 








*/ 
Byte-BOS™ \|ntegrated Systems 6 9 8836 void ( * const vec_tab[])() = { 
PO Box 3067 Del Mar CA 92014 1 799 aes Looe ae 
default / * SPI ¥/ 
default, / * Pulse accumulator input edge */ 
CIRCLE NO. 731 ON READER SERVICE CARD default, / * Pulse accumulator overflow */ 
default, / * Timer overflow */ 
default, / * Timer output compare 5 */ 
s s s 2 default, / * Timer output compare 4 */ 
implify Real-Time Programs with default / © Timer output compare 3. */ 
default, / * Timer output compare 2 */ 
default, / * Timer output compare 1 */ 
TM default, / * Timer input capture 3 */ 
default, / * Timer input capture 2 */ 
default, / * Timer input capture 1 */ 
‘ ; default, / * Real time interrupt */ 
Concurrent Programming Toolkit default, / * TRQ_ */ 
default, / * XIRQ */ 
* . default, / * SWI */ 
Vi rsion oy | now av. il | default, / * illegal op-code */ 
‘ ann . default, / * COP watchdog timer fail */ 
Interwork is a “C” program library that lets you write your programs as dae atle. pf €5C0P quitter clock Sai. #/ 
a set of cooperating, concurrent tasks. This greatly simplifies program- penet., / * RESET */ 
ming for real-time applications, simulations, and other parallel pro- }; 
grams. Interwork gives you much more control and efficiency than do . End Listing Two 
heavyweight, multi-tasking operating systems. — g 
FEATURES Listing Three 
Unlimited number of tasks with very fast context switching using a / * Fai recpetec ore eb cays ig ga Slap zh haa Weave haan 
tional, fully preemptive, time-sliced scheduling. Tasks have full ee ee a ee 
op ’ yp p ’ IWS : d g. 3 because that's how putchar is defined by ANSI. The (char) cast is done 
control over preemption and their time-slice sizes. ; 4 to avoid "illegal assignment" warnings from strict compilers. */ 
Full inter-task communication using shared memory. Includes : i oeraie ae aie : are : . os oars — 7 
H H 4 etine *(cnar * x * ata Kegister * 
semaphores, queues, event variables, mailboxes. — 7 #define TDRE ®x8@ / * Transmit Data Register Empty bit */ 
Enhanced interrupt and trap handling facilities. Write interrupt 8 int putchar(int c) 
handlers as C procedures and block tasks on interrupts. ; 9 
Enhanced facilities for managing task stacks. sel 
Debugging, tracing and exception handling facilities. pshd 
Expanded user’s guide with examples and complete reference ee. ener 
manual with on-line documentation. ANSI compliant header files. Li: : Tine Tk, ekeee S 
“Interwork has been running flawlessly for over two years.” ; 10 while (!(SCSR & TDRE)) 
— Glenn Catarious, Quadis, Inc. ldab -102eH 
Interwork is available for the following systems: — oe 
eq 
IBM PC, XT, AT PC-DOS 2.0 or later > 11 / * loop until ready to transmit */ 
Intel 386 UNIX V.3/386 ; 12 SCDR = (char) c; 
DEC VAX, Sun UNIX BSD, Sun OS a oi 
PC-DOS version is compatible with Borland and Zortech C/C** compilers (large model) and Microsoft, ; 13 return(c); 
and Lattice C compilers (large and small models). Please specify hardware and operating system when , ldd OFST+0.x , 
ordering. Shipping and handling free within continental U.S.; COD orders add $5; international orders, : 
add $25; Canada, add $12. Send check or money order to: dere 
. pulx 
Block Island Technologies ae 
Innovative Computer Software ie. eee 


15455 NW Greenbrier Parkway, Suite 210, Beaverton, Oregon 97006 -end ; 
Tel. (503) 690-7181 FAX (503) 645-7732 End Listings 


Trademarks: Interwork, Block Island Technologies; UNIX, Unix System Laboratories; VAX, Digital 
Equipment Corporation; Sun, Sun Microsystems, Inc. 
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Van Gogh’s talent 
went unrecognized 


for a lifetime. 


Luckily Motorola 
didn’t wait that long 
to recognize ours. 


Here at Introl, we’ve been design- 
ing and creating superior software 
products for over a decade. During 
that time, we’ve seen software ven- 
dors come and go. But we have 
remained dedicated to providing 
our customers with the most up-to- 
date software at very reasonable 


prices. 


Haven’t heard of us? Motorola has. 
It didn’t take them long to recog- 
nize the consistency and quality 
that have been a part of Introl’s phi- 
losophy for years. Our intimate size 
allows us to give our customers the 
quality technical support that they 
need—on a first-name basis. We 
can give you the attention that you 
need to finish an innovative project 


on time. And at Introl, you won't 


be talking to a “Reproduction” but 
rather with the original technical 
team that designed and developed 


our software. 


Our Introl-C compilers produce 
quality code in record time. They 
also offer flexibility. Introl 1s the 
only software vendor that has pro- 
duced a coherent palette of C com- 
pilers which support Motorola’s 
68HC11, 68HC16, and 68332 
microprocessors. Our C compilers 
and assemblers are also available 
for other popular microprocessors: 
the 68000 family (including the 
68020 and 68030), the 6801, and 
the 6809. 






Introl also produces a Source Level 
Debugger which works with a vari- 
ety of emulators and also has the 
ability to work with the intrinsic 
background debug mode of the 
68HC16 and 68332 microproces- 


SOTS. 


Ask Motorola. They’ve been using 
Introl software for years. They 
depend on us for quality perfor- 
mance, versatile development tools 


and expert technical service. 


Don’t let a masterpiece slip through 


your fingers. Call Intro! today. 


Telephone: 414-327-7171 
Fax: 414-327-7734 





9220 W. Howard Ave., Milwaukee, WI 53228 
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NETWORKING 


Listing One (7ext begins on page 80.) Listing Two 


From file:/sys/netinet/in_pcb.c: 
From file:/sys/netinet/tcp_input.c: 
/* Find a internet protocol control block associated with a given session. 
* Session is determined as a match between foriegn (faddr, fport) and local 


* (laddr, lport) source/destionation identifiers. Allows wildcard matches /* * Locate peb for segment. * */ 
* when some but not all of the parameters are known. */ findpcb: 
struct inpcb * inp = tcp_last_inpcb; 
in_pcblookup(head, faddr, fport, laddr, lport, flags) /* first, see if this segment looks familiar */ 
struct inpcb *head; if (inp->inp_lport != ti->ti_dport || 
struct in_addr faddr, laddr; inp->inp_fport != ti->ti_sport {| 
u_short fport, lport; inp->inp_faddr.s_addr != ti->ti_src.s_addr {| 
int. flags: inp->inp_laddr.s_addr != ti->ti_dst.s_addr) { 
{ inp = in_pcblookup(&tcb, ti->ti_sre, ti->ti_sport, 
register struct inpcb *inp, *match = Q; ti->ti_dst, ti->ti_dport, INPLOOKUP_WILDCARD) ; 
int matchwild = 3, wildcard; if (inp) 
tcp_last_inpcb = inp; 
for (inp = head->inp_next; inp != head; inp = inp->inp_next) { ++tcppcbcachemiss; 
if (inp->inp_lport != lport) } 
continue; 
wildcard = Q; * ae 
if (inp->inp_laddr.s_addr != INADDR_ANY) { ou End Listing Two 
if (laddr.s_addr == INADDR_ANY) Listing Three 
wildcard++; 
else if (inp->inp_laddr.s_addr != laddr.s_addr) 
continue; From file:/sys/netinet/tcp_input.c: 
} else { 
if (laddr.s_addr != INADDR_ANY) ate 
wildcardtt; /* Header prediction: check for the two common cases of a unidirectional 
} * data xfer. If the packet has no control flags, is in-sequence, the window 
if (inp->inp_faddr.s_addr != INADDR_ANY) { * didn't change and we're not retransmitting, it's a candidate. If the length 
if (faddr.s_addr == INADDR_ANY) * is zero and the ack moved forward, we're the sender side of the xfer. Just 
wildcard+t+; * free the data acked & wake any higher level process that was blocked 
else if (inp->inp_faddr.s_addr != faddr.s_addr j; * waiting for space. If length is non-zero and the ack didn't move, we're the 
inp->inp_fport != fport) * receiver side. If we get packets in-order (the reassembly queue is empty), 
continue; * add data to the socket buffer and note that we need a delayed ack. */ 
} else { if (tp->t_state == TCPS_ESTABLISHED && 
if (faddr.s_addr != INADDR_ANY) (tiflags & (TH_SYN|TH_FIN|TH_RST|TH_URG;TH_ACK)) == TH_ACK && 
wildcardt+; ti->ti_seq == tp->rcv_nxt && 
} ti->ti_win && ti->ti_win == tp->snd_wnd && 
if (wildcard && (flags & INPLOOKUP_WILDCARD) == @) tp->snd_nxt == tp->snd_max) { 
continue; if (ti->ti_len == 0) { 
if (wildcard < matchwild) { if (SEQ_GT(ti->ti_ack, tp->snd_una) && 
match = inp; SEQ_LEQ(ti->ti_ack, tp->snd_max) && 
matchwild = wildcard; tp->snd_cwnd >= tp->snd_wnd) { 
if (matchwild == @) /* this is a pure ack for outstanding data. */ 
break; ++tcppredack; 
} /* need to update round trip timers */ 
} if (tp->t_rtt && SEQ_GT(ti->ti_ack,tp->t_rtseq) ) 
return (match) ; tep_xmit_timer (tp) ; 
} /* free sent data that was acknowledged */ 


acked = ti->ti_ack - tp->snd_una; 
tepstat.tcps_revackpacktt; 
tepstat.teps_revackbyte += acked; 
sbdrop(&so->so_snd, acked) ; 


End Listing One 


using current (possibly backed-off) value. 
» SOUFCE 
Microkernel : 
output or persist. */ 


tp->snd_una = ti->ti_ack; 
* 
* 
* 
* If process is waiting for space, 
* 
* 
* 
if (tp->snd_una == tp->snd_max) 
tp->t_timer [TCPT_REXMT] = @; 
else if (tp->t_timer[TCPT_PERSIST] == 0) 
tp->t_timer [TCPT_REXMT] = tp->t_rxtcur; 
if (so->so_snd.sb_flags & SB_NOTIFY) 
sowwakeup (so) ; 


/* free this packet */ 
wakeup/selwakeup/signal. If data are ready 
if (so->so_snd.sb_cc) 
os for 1386 and sane ProGS@ssors 





m_freem(m) ; 
/* If all outstanding data are acked, stop 
a com Dp | e t eC retransmit timer, otherwise restart timer 
to send, let tcp_output decide between more 
(void) tep_output (tp); 


return; 

} 

} else 

/* This is a pure, in-sequence data packet with nothing on the 

based on * reassembly queue; we have enough buffer space to take it. */ 
if (ti->ti_ack == tp->snd_una && 
; tp->seg_next == (struct tcpiphdr *)tp && 
OSF/Motif™ GUI Rete <= sbspace(&so->so_rcv)) { 


/* update received state and statistics */ 
tp->rev_nxt += ti->ti_len; 
++tcppreddat; 
tepstat.tcps_rcevpackt++; 
te fa Cc € tepstat.tceps_revbyte += ti->ti_len; 
/* Deliver data by dropping TCP and IP headers, then 
* add data to application's socket buffer, and wakeup 
* application if necessary. */ 
m->m_data += sizeof(struct tcpiphdr) ; 
m->m_len -= sizeof(struct tcpiphdr) ; 
sbappend(&so->so_rev, m); 
sorwakeup (so) ; 
/* request that a acknowledgement be sent with next 
* data packet outbound -- we delay in hopes of 
* "piggy backing" on top of a data packet. 
tp->t_flags |= TF_DELACK; 
return; 


in g 7 End Listings 


MT XINU- 2560 Ninth Street, Berkeley, CA 94710 A 
510 644-0146 A Fax 510 644-2680 A mach@mtxinu.com 
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ANNOUNCING 
PASCAL. 


A High Performance Option 
For Turbo Pascal Users 


We agree that Borland Turbo Pascal® is a great street 
machine. But when your applications demand the 
ultimate in performance, the Stony Brook Pascal t 
optimizing compiler has the extra power you need — and 
an unbeatable list of standard features. 


DON’T CHANGE A LINE OF YOUR CODE 

It couldn’t be easier to trade up to Pascal + . We’re 100% 
language-compatible with Turbo Pascal V6. You don’t 
change your Turbo Pascal programs at all. Just compile 
with Pascal+ and get: 


Execution speeds up to 100% faster 
Code size up to 30% smaller 


You get fully optimized code with NO difference in your 
program's operation! 


ELIMINATE ROADBLOCKS 
Pascalt comes fully equipped to handle ANY 


programming problem you encounter. Unlike Turbo 
Pascal, we produce standard Microsoft® objects, support 


all memory models, and give you complete control over 
procedure-calling and parameter passing conventions. 


INSTANTLY LINK TO OTHER LANGUAGES 

With Pascal t+ it’s easy to interface with code written in 
any other language. This means, for instance, that you 
can have immediate access to millions of lines of 
commercial library code written by and for C 
programmers. 


BEYOND 
TURBO 


, AND WE DO WINDOWS! 
Stony Brook Pascal + comes with 
full support for Microsoft Windows 3.0. 

We provide the interface units, and you use the 
windows API exactly as you would with Microsoft C. 


In fact, anything you can do with 
Microsoft® C or TurboC®, you can do 
with Stony Brook Pascal ¥ ! 


So get on the track with Stony Brook Pascal+. You'll 
qualify for races you never could enter before. 


CALL NOW OR WRITE FOR INFORMATION 
800/624-7487 © 805/496-7429 Fax © 805/496-5837 Outside U.S. 


SOFTWARE 


187 E. Wilbur Rd., Suite 4, Thousand Oaks, CA 91360 
Makers of Stony Brook Professional Modula-2 and QuickMod 


Buy directly from us and pay only: 


$295 


plus shipping (and tax if applicable), 
U.S., Canada, Mexico 


$375 


includes shipping, 
outside U.S., Canada, Mexico 





Turbo Pascal and Turbo C are registered trademarks of Borland International, Inc. Microsoft and Windows are registered trademarks of the Microsoft Corporation. 
Stony Brook Software and Pascal + are trademarks of Gogesch Micro Systems, Inc. 
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PROGRAMMER'S WORKBENCH 


Listing One (7ext begins on page 110.) 


/* Code Example 1: loop parallelism sum reduction 


* critical block within loop nest 
* B. E. Bauer 1992 

* compiled using the following on the IRIS: 

* parallel: cc -02 -o examplelp -mp examplel.c 
* serial: cc -02 -o example1 examplel.c 


#/ 

#include <stdio.h> 

#define MAX 1536 

double a[MAX] [MAX], b [MAX] [MAX]; 


main() 

{ 
int 1,39 K. ks 
double sum=0.0, temp; 


/* shared arrays filled; values not important, just order */ 
for (i=@; i<MAX; i++) { 
for (j=0; j<MAX; j++) { 
ali] [j] = (double)i; 
bli] [j] = (double) j; 
} 
} 
k1 = 256; /* inner loop delay factor */ 


/* start of parallel region */ 
#pragma parallel shared(a,b,sum) local(i,j,k,temp) byvalue(k1) 


/* pfor block */ 
#pragma pfor iterate(i=0; MAX; 1) 
for (i=@; i<MAX; i++) { 
for (j=@; j<MAX; jt+) { 
temp = ali] [j]-bli][j]; 
for (k=@; k<k1; k++) /* lengthen execution time */ 
temp t= @.@1; 
#pragma critical 


sum += temp; /* summation in inner loop */ 


} 
} 
} 
} 
printf("\nDone! sum = %24.15f\n", sum) ; 
) 
® e 
End Listing One 
e e 
Listing Two 
/* Code Example 2: loop parallelism sum reduction 
* critical block moved into local code 
* 
* B. E. Bauer 1992 
* 
* compiled using the following on the IRIS: 
* parallel: cc -02 -o example2p -mp example2.c 
* serial: cc -02 -o example2 example2.c 
*/ 


#include <stdio.h> 
#define MAX 1536 
double a[MAX] (MAX], b [MAX] [MAX]; 


main() 
if 
int 2, J; kk ’ 
double sum=0.0, temp, st; /* st = local code subtotal */ 


/* shared arrays filled; values not important, just order */ 
for (i=@; i<MAX; itt) { 
for (j=; j<MAX; jt+) { 
ali] [j] (double) i; 
bli] [j] = (double) j; 


} 
} 
k1 = 256; /* inner loop delay factor */ 


/* start of parallel region */ 
#pragma parallel shared(a,b,sum) local(i,j,k,temp,st) byvalue(k1) 
{ 


st = @.0: /* st initialized in each thread */ 


/* pfor block */ 
#pragma pfor iterate(i=0; MAX; 1) 
for (i=@; i<MAX; it+) { 
for (j=@; j<MAX; j+t) { 
temp = a[i] [j]-blil [3]; 
for (k=O; k<k1; k++) /* lengthens execution time */ 
temp t= @.@1; 
st += temp; 


} 


/* local code grand summation */ 
#pragma critical 


sum += st; 


} 


printf("\nDone! sum = %24.15f\n", sum) ; 


Listing Three 


/* example 3: block parallelism, producer-consumer problem 


* B, E. Bauer 1992 
* Taken from "Practical Parallel Programming" pp 214-225 


* (C) 1992 Academic Press, Inc. All Rights Reserved. Used with permission. 


x/ 
#include <stdio.h> 


#define MAXCONSUMERS 3 

#define MAXPRODUCTS 1800 
#define STOP_TOKEN -1 

#define NULL_ITEM (node_t *)NULL 


/* the product is defined here */ 
struct node 
{ 
int productid; 
struct node *next; 
ve 
typedef struct node node_t; /* the typedef is convenient */ 


node_t *root = NULL_ITEM; /* linked list root */ 


int max_depth = @;. 


/* ----- create a new "product" -------------------------------- */ 
node_t *make_product(node_t *item, int prodid) 
{ 


item = (node_t *)malloc(sizeof(node_t)); 
if (item != NULL_ITEM) { 
item->productid = prodid; 
item->next = NULL_ITEM; 


} 
else 
printf("problems with production, boss...\n"); 
return (item); 
} 
[* ----- add a "product" to the end of the list ------------- --- %/ 
node_t *ship_product(node_t *new) 
{ 
int cur_depth = Q; 
node_t *list = root; 
if (root == NULL_ITEM) 
root = new; 
else 
{ 
while (list->next != NULL_ITEM) 
{ 
list = list->next; 
cur_depth++; 
} 
cur_depth++; 
list->next = new; 
} 
if (cur_depth > max_depth) 
max_depth = cur_depth; 
} 
/* ----- pop the product off the beginning of the list --------- */ 
node_t *receive_product () 
{ 
node_t *item = root; 
if (root != NULL_ITEM) 
root = root->next; 
return(item) ; 
} 
[% -<--- consume the product by freeing its memory ------------- */ 
void consume_product(node_t *item) 
{ 
if (item != NULL_ITEM) 
free (item); 
} 
/* ----- the producer s<$s-<-<—-seeeeseh em eecsensessessesseans=s %/ 
void producer (int cnt) 
{ 
node_t *temp; 
int i; 
/* make "products" cnt times (limits simulation) */ 
for (i=@; i<=cnt; i++) 
{ 
if ((temp = make_product(temp,i)) == NULL_ITEM) 
break; 
#pragma critical 
ship_product (temp) ; 
} 
/* load on stop tokens */ 
for (i=@; i<MAXCONSUMERS; i++) 
{ 
if ((temp = make_product (temp, STOP_TOKEN)) == NULL_ITEM) 
break; 
#pragma critical 
ship_product (temp) ; 
} 
printf("producer calls it quits\n"); 
} 


End Listing Two 
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Writing a Windows application is never simple. There 
are, of course, tools around that make life easier for 
you. The compiler makers give you fancy dialog 
boxes and buttons; there are case tools which will 
build the entire program outline, but there comes a 
time when YOU must fill the gap between the program 
and the data. At Highland, we understand the 
problem of Windows programming, which is why we 
have produced the Highland Software Engine. 


HIGHLAND’S SOFTWARE ENGINE 

The Software Engine is a collection of Dynamic Link 
Libraries which are all accessed via a central Kernel. 
With the Software Engine you can add support for 
imaging, database and communications, speeding the 
development of your application. 


WinComm’s Cost £260.00 Source £380.00 
WinComm allows you to interface through the 
serial channel or to interface directly with a Hayes 
compatible modem. The serial channel support 
allows you to initialize a serial channel and to do 
character, string, or buffered I\O through the serial 
port. The modem support allows you to perform all 
of the higher level commands with the modem 
such as autoanswer, dial, etc. WinComm supports 
XMODEM, YMODEM, KERMIT and ASCI! 
communication protocols. 


WinBase Ill+ Cost £99.00 Source £199.00 
WinBase III+ allows you to create, access and 
update files that are 

compatible with the Ashton-Tate’s dBASE III 

PLUS database management system. You do not 
actually need to own a dBASE package in order to 
use this product. WinBAse Ill+ provides you with 
an Indexed Sequential Access Method (ISAM) 
facility that can be used to develop a relational 
database management system just like the popular 


dBASE products. With this ability, WinBase III+ 
gives you an alternative to programming in the 
dBASE interpretive language. 


Winlmage Cost £199.00 Source £995.00 
Winlmage allows you to display and print virtually 
any type of image from your windows application. 
Winlmage currently supports PCX, TIFF, Adobe 
Encapsulated Postscript, Windows Metafile and 
Bitmap image formats. 


Winlmage FX Cost £265.00 Source £995.00 
With additional multi media functions including 
animation and special effects. 


FEATURES: Built-in debugger <> Built-in help system 
~ Simplified Hyper functions <> consistent interface 
for each library <> Each library available separately 
< No run-time royalties <> Source code available <> 


Highland Grafix Ltd., 18 Albion Way, East Kilbride 
G75 OYN, United Kingdom. 
Tel: 03552 64888, Fax: 03552 64777 


Highland Inc., PO BOX 640, 190 Walnut Tree Hill, 
Sandy Hook, CT 06482. United States 
Tel/Fax (203) 426 6773 
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Distributed Software Configuration Management 
Operates Seamlessly Across Multiple Platforms 


Developing and enhancing 
applications today is challenging and 
exciting. Connecting smart PC and 
Workstation interfaces to host systems 
yields applications that were just 
dreams a few years ago. But control- 
ling development across multiple 
platforms can turn the dream to a 
nightmare. 

PVCS is the answer—software 
configuration tools that work in all 
popular environments. PVCS promotes 
code reuse, eliminates module and 
build errors and enables reconstruction 
of all previous versions at the module or 
system level. PVCS provides the 
control and audit that frees program- 
mers to fully exploit the productive 
nature of distributed development 
without hazard. 

PVCS operates seamlessly across 
popular UNIX platforms as well as MS- 
DOS and OS/2. PVCS also synchro- 
nizes to mainframe-based CM systems 
through PVCS Production Gateway. 

Getting started with the new PVCS 
Version 5.0 1s easy. You can start using 
PVCS at any point—even with projects 
that are underway. And if you are using 
SCCS or RCS, our importer can 
upgrade your existing archives. 


NEW 5.0 Features Include: 

— = Windows and PM Graphical User 
Interfaces—speed and simplify adoption 
even by junior programmers and 
administrative users. 

@ Application Programming Interface 
(API)—integrates PVCS features into 
your development environment (or even 
into your applications). 

BH New Multt-Locking Techniques— 
enhance parallel and workgroup 
development. 

BH = Definable Promotion Models— 
define and enforce a structure for the 
developmental progression of your 
projects. 

= Footprinting Support—an explicit 
list of the elements that make up an 
object (as well as other information you 
define) can be embedded within that 


module. The footprint is checked at link | 


time to ensure dependencies are 
satisfied, or can be read with a 
standalone utility to reveal what 
revisions of modules make up an 
application. | 

BM Performance Enhancements— 
Improved across the board. 

HB National Language Support (NLS) 
and Double-Byte Character Support 
(DBCS). 


1992© INTERSOLYV, Inc. All rights reserved. INTERSOLYV, APS, Excelerator, Design Recovery, and PVCS are trademarks of INTERSOLY, Inc. IBM 
and AD/Cycle are registered trademarks of International Business Machines Corporation. All other products are trademarks and/or registered 


trademarks of their respective companies. 
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= Improved and Simplified Access 
Control—Security and access is prima- 
rily controlled by a central database. 
Fine-grained file and function security is 
also provided. 


For information on INTERSOLV 
products, call: 

1-800-547-4000, in North America 
(44) 727 812 812, outside North 
America 


YES! Send me a copy 


| of "Comprehensive Software 
Configuration Management" 








Address 





City, State, ZIP 








| Company Phone 
| Send care of Special aay INTERSOLY, INC. 


INTERSOLV 


Desktop Development Tools for the 90s 








PROGRAMMER'S WORKBENCH 


Listing Three (Listing continued, text begins on page 110.) 


the consumer (created 3 times) 
long consumer (int myid) 


/* local variables */ 
int consuming = 1, j; 
long consumed = Q; /* count of "products" */ 
node_t *item; 
double temp; 


printf("consumer %d starts the shopping day\n", myid); 
while (consuming) /* loop until stop token seen */ 


for (j=0; j<32000; j++) 
temp *= (double) j; 


#pragma critical 


item = receive_product (); 
} 
if (item != NULL_ITEM) 
{ 


if (item->productid != STOP_TOKEN) 
{ 


if (item->productid) 
++consumed ; 
printf("consumer %d consuming product %d\n", 
myid, item->productid) ; 
} 
else 
{ 
consuming = @; 
printf("consumer %d ran out of products\n", 
myid) ; 
} 
consume_product (item) ; 
} 
} 
printf("consumer %d consumed %d products\n", myid, consumed) ; 
return (consumed) ; 


main () 
{ 
long total, sum[MAXCONSUMERS] ; 
int i; 
printf("business starts with %d products\n",MAXPRODUCTS) ; 
#pragma parallel shared (sum) 
{ 
#pragma independent 
producer (MAXPRODUCTS); /* start producer */ 
#pragma independent 
sum(@] = consumer(1); /* start consumer 1 */ 
} 
#pragma independent 
sum[1] = consumer(2); /* start consumer 
#pragma independent 
sum(2] = consumer(3); /* start consumer 
} 
} 
for (i=, total = OL; i<MAXCONSUMERS; it+) 
total += sum[i]; /* sum up the products consumed by each */ 
printf("business over, %d products exchanged\n", total); 


printf("maximum products in list = %d\n", 
max_depth-MAXCONSUMERS) ; 


End Listings 


C Language 


#13 


Given all the confusion surrounding null pointers, wouldn’t it be 
easier simply to require them to be represented internally by 0s? 


What would such a requirement really accomplish? 


#14 


Seriously, have any actual machines really used nonzero null 
pointers? 


Machines manufactured by Prime and by Honeywell-Bull, as 
well as Symbolics Lisp Machines, have done so. 


Copyright © 1992 Steve Summit 





Dr. Dobb's Journal, August 1992 















SKOMAED KS 
| Dpakeabaerey@ nates 


Well oo... 
help Master of 


Software 
you. Engineering 


master Camegie 


Mellon 
University 


Carnegie Mellon University 
offers an academic program 
leading to a professional 
Master of Software 
Engineering (MSE) degree. 


For information contact— 


MSE Admissions Coordinator 
Software Engineering Institute 
Carnegie Mellon University 
Pittsburgh, PA 15213-3890 


(412) 268-7713 





Latest and greatest QNX4 has it all! 


POSIX compliant with Arcnet and Ethernet support, 
QNX Windows, RunDOS and TCP/IP 









Also available for QNX4: 
@ Real-time Process Control & Business Graphics 
® Real-time Spread Sheets 

@ Hypertext 

@ DataBase Products 

@ Edit 2000 

@ FlexeLint 

®@ Macro Assembler 

@ EZ Menu & EZ File Manager 

® QMaster File Manager 

© C Toolkit Plus 

@ QNX-DOS Connectivity: 

DynaDOS 

DOSNx 


















Call or fax for new products & catalog. 
Also available: QNX2 and associated products. 






T&T Computer Products, Inc. 
QNX Specialists 
P.O. Box 14010, Tulsa, OK 74159 
Tel: 918/663-1879 FAX: 918/663-4054 
VISA/MC 


QNX is a registered trademark of Quantum Software Systems, Ltd. 
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SEPTEMBER 21-25, 1992 


CONFERENCE & EXPOSITION 
mf tn SANTA CLARA, CALIFORNIA 
| 


with Object-Oriented 
Analysis & Design 






ad i ——) —1 > 1 Be => 4 


eRONw RO) x 


UNIVERSITY 


CORPORATE EDUCATION CENTER 





To register for C++ in Action, 
or to receive a FREE brochure, call 
508/649-4200, 
or fax your inquiry to 
508/649-2162. 
m 


For more information o 
exhibiting, call 


800-733-3593, ext. 261, 


or fax your inquiry to 


508/649-6926. 


PROGRAMMING PARADIGMS 


The Living Room and 
Other Markets 


pple is starting a consumer-prod- 

ucts division. Apple is not well 

known in this market, and tra- 

ditional consumer-goods vendors 
might say that this market is not well 
known at Apple. Perhaps a word of ad- 
vice would be appropriate. Perhaps it 
would even be appreciated. 

Rumor has it that one can see, taped 
to a wall here and there in Apple’s Cu- 
pertino quarters, copies of a letter from 
Stan Cornyn, president of Warner New 
Media. Excited by the prospect of Ap- 
ple getting into consumer products, 
Cornyn had written to share his 
thoughts on how Apple could avoid 
getting Betamaxed out of the market. 
I here reproduce the letter, which 
Cornyn doesn’t mind the world seeing, 
in full. 


21 Rules for Apple Consumer Electronics 
Once you make a box, don’t change 
the way it works for at least ten years. 

2. Start from cheap boxes and progress 
to expensive ones, as the Japanese 
automobile industry has done. 

3. Find an exposure method as power- 
ful as top 40 radio and video arcades 
to show what your box can do. 

4. Hire a vice-president of CD-ROM 
sales. Fund the position. Insist that 


Michael Swaine 


the VP sell 500,000 CD-ROM drives 
in 1992, no matter what it takes. 

5. Lowering the price of your box is 
not enough. You must also increase 
its value for all to see. 

6. Once your CD-ROM drives have 
been sold, help publishers learn the 
addresses of buyers. Otherwise publi- 
shers can’t target their marketing ef- 
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forts and will begin to sleep around. 

7. Real consumers do not want tools. 
They want the product of tools. 

8. Forego early profits for long-term mar- 
ket share, and don’t bitch about it. 

9. Court mass publishers. If they can 
sell a million copies, you have sold 
a million boxes. 

10. Stop selling to school administrators. 
Sell to students. 

11. Make your box part of the home-in- 
formation system. Make portable 
boxes that can connect with tele- 
phones, stereos, and cable. 

12. Present your CD-ROM players to the 
public as Ultra CD audio players that 
can do extra things. That way peo- 
ple will catch on faster. 

13. Fifty titles is not enough. Two thou- 
sand is more like it. Devise a plan 
to obtain 2,000 titles fast. 

14. Color pictures look better than black 
and white. 

15. Having better-looking pictures isn’t 
enough. Remember Betamax? More 
titles is the answer. 

16. Can you sell this box at Wal-Mart? 
If not, rethink it. Maybe paint it pink. 

17. Whoever makes a box that easily 
converts to digital video is the win- 
ner. Whoever doesn’t is planning ob- 
solescence. 

18. CD systems using TV screens will 
always be cheaper. Emphasize the 
advantages of monitor display. 

19. Don’t call it a computer. Call it, for 
instance, a Time Machine. Not P3- 
TV. 4D-TV is a better choice. 

20. Start product development with a 
breakthrough marketing campaign. 
Force your engineers to catch up 
with you. 

21. Careful market research results in 
one sure thing: You will be late to 
market. 





Divide and Conquer 

The story that this list can be seen taped 
to the walls at Apple is not surprising. 
Apple seems to be very serious about 
the consumer market, recognizing, as 
its legions of unsolicited advisors would 
like it to recognize, that the consumer 
market is not just another niche, not 
even just another market in the sense 
that video professionals are another mar- 
ket, but something quite different from 
anything Apple is used to. Apple is go- 
ing to go after the consumer market via 
a new, separate division of the compa- 
ny. That’s a promising sign, although 
the initial forays into the consumer mar- 
ket look suspiciously like spin-offs or 
repackagings of existing computer-di- 
vision products and technologies. 

In any case, along with the spin-off 
companies Kaleida and Taligent, it looks 
like Apple’s new strategy is divide and 
conquer, in the amoebic, rather than the 
Napoleonic, sense. 

Here’s a mid-year summary of devel- 
opments in three areas holding a lot of 
Apple’s attention lately: multimedia tech- 
nology, the home and/or consumer mar- 
ket, and its more traditional hardware 
and operating-system efforts. 

Kaleida, the multimedia company, has 
a CEO: Nat Goldhaber. Theoretically, 
the company was launched last Octo- 
ber and apparently some not-so-theo- 
retical work was getting done under its 
nameplate, but it’s hard to take a com- 
pany seriously until it has a CEO. Gold- 
haber has been bouncing around the 
Mac universe for a while; he was one 
of the founders of Centram, the devel- 
oper of TOPS. 

QuickTime has caught on. One Ap- 
ple competitor, Silicon Graphics, is re- 
portedly negotiating to license it, with 
the fairly obvious intent of using it to 
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make SGI’s low-end Indigo (described 
here in February 1992) even more com- 
petitive with Apple’s high-end Quadra 
line for video professionals. Currently, 
the Indigo is clearly a better deal than 
a Quadra on a hardware basis, but SGI 
has nothing like Apple’s third-party soft- 
ware base. SGI machines, however, 
have very impressive video-editing and 
video-production capabilities, a point 
not lost on the some 300,000 video pro- 
duction professionals in the U.S., among 
whom SGI is highly respected. That in- 
cludes the Hollywood types: SGI hard- 
ware was used in producing the special 
effects for Terminator IL SGI is also re- 
portedly negotiating for new Apple 
imaging model technology. 

Apple, eyeing those 300,000 video 
professionals and the future in-house 
video professionals in business and the 
potential VCR home-movie producers, 
would like to get some of that Holly- 
wood credibility. The theory is that re- 
spect trickles down. If that’s the theo- 
ry, then Apple’s visibility at this year’s 
National Association of Broadcasters 
conference didn’t hurt. Not only was 
Apple there in force, but there were al- 
so a lot of third-party products that 
showed off the Mac’s capabilities for 
editing video, teleprompting, and com- 
municating with various entrenched for- 
mats and devices. 


The Brave Little Toaster Division 

Apple’s PDAs, or personal digital assis- 
tants, are actually going to be called 
Apple Intelligent Assistants, or at least 
the first ones are. These handheld 
things, due out early next year but pre- 
viewed at CEBIT in Germany and CES 


_miniEd Editing Tools 
The maximum answer for add-in editing 
Now with 'C' or Assembler source code 


The best cross platform, cross language editing tools just got 
better. If you need to integrate text editing functions into your 
applications, the _miniEd family of editing products provides 
the solution. Handle document entry, pop-up windows, memo 
fields or any other formatted text requirement with a simple 
call to a __miniEd function. 
Cross Platform: DOS / WINDOWS / UNIX 
Cross Language: 'C'/ BASIC / PASCAL / ASM / MODULA 2 
Features: "Format-on-the-fly" word wrap technology 
All standard editing functions supported 
VIEW mode for formatted text display 
Works with any size screen window 
Full source code provided 
_miniEd - RAM based editing in a 5K package 
_miniEd Toolkit - File or RAM based editing 
NEW _miniWords-150000 words & support routines 
In May _miniSpell - add-in spell checking 
At SSDI, we don't take the word “mini” lightly. All __miniEd 
tools are compact, tightly structured modules that won't 
impose a size or performance overhead on your applications. 
The tools are priced from $99.00 to $349.00. To order or 
obtain more information call (708/778-6060) or FAX (708-778- 
6063). VISA and MC accepted. Please specify 'C’ or 
assembler source when ordering. 
Strategic Software Designs, Inc. 
6S$235 Steeple Run Drive - Sulte 12B - Naperville, IL 60540 
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in the U.S., are not really consumer 
products, but they do show Apple tak- 
ing a step in the direction of toasteri- 
zation. The original Mac, which many 
people thought looked like a toaster, 
was supposed to be an appliance. It 
wasn't, and subsequent tweaks moved 
it farther and farther from Toasterland, 
jumping the tracks at Computerville 
Junction. PDAs (aka AIAs) look like the 
first step Apple has made back in that 
direction since 1984. 

Here’s what the weeklies think the 
first AIA will be like: Due to ship in Jan- 


It looks like Apple’s 
new strategy 1s 
divide and conquer, 
in the amoebic, 
rather than the 
Napoleonic, sense 


uary 1993. Price under $700 and falling. 
RISC-based and faster than a Mac IIfx, 
hardware by Sharp, multitasking OS by 
Apple. OS to be licensed to other man- 
ufacturers. Pen-based, with no boxes to 
fit letters into, no training required (or 
allowed?), and no visible file system. In- 
frared link, serial port, able to dock to 
a Mac and maybe to a PC running Win- 
dows. Titles on SRAM cards being de- 
veloped by third parties like Random 
House. 

Apple’s approach to these things ap- 
parently didn’t sit well with Bill Atkin- 
son when he was still with the compa- 
ny, and was reportedly the reason for 
his founding General Magic with Andy 
Hertzfeld and Marc Porat, where they 
have been pursuing the approach they 
think ought to be taken to handheld 
computing devices. Although they’re 
very secretive and deny all published 
reports about what they’re doing, their 
idea apparently has to do with an oper- 
ating system for devices that communi- 
cate with other devices: cellular phones, 
handheld computers, and so on. Rather 
than producing a product and putting 
their label on it, they are pursuing li- 
censing agreements. How this fits with 
Apple’s efforts is unclear, but Apple 
(along with Sony and Motorola) is a big 
investor in General Magic. 

Meanwhile, Apple is working on the 
multimedia Macs that it expects to put 
on the shelves for Christmas. The Ap- 


ple version of the MPC, apparently. Per- 
haps as evidence that it is trying to learn 
how to address this market, the com- 
pany recently: 1. Put together a compu- 
ter+ printer + software bundle for under 
$1000 (the OS and bundled software are 
on ROM), 2. has been evangelizing 
heavily for content providers for its mul- 
timedia Macs; and 3. brought out new, 
more powerful versions of its machines 
at prices lower than those of the ma- 
chines they replaced. Apple is trying, 
folks. 


They Also Serve 

And on the traditional Apple front, try- 
ing to convince corporate buyers that 
the Mac is not a toy, Apple has been 
hearing from other advisors. Consultants 
recently presented Apple with a list of 
priorities, high among which was, build 
us a dedicated server. Mac managers 
generally look elsewhere than Apple for 
server hardware and software, but Ap- 
ple hopes to change that with its dedi- 
cated server, expected to ship in Jan- 
uary 1993. It’s a 68040 box without 
24-bit color and other frills, and with 
A/UX as its native OS, but with support 
for Mac System 7 and System 6. 

Oh, yes, A/UX. Apple’s version of 
UNIX. One of the many Apple operat- 
ing systems. Apple will be spending 
more time in the future supporting var- 
ious operating systems, apparently. The 
Mac OS is getting reengineered and fit- 
ted with a UNIX-like kernel, and should 
be around for a while. The operating 
system being developed for the Pow- 
erPC architecture by the Taligent joint 
venture with IBM is being described as 
addressing a different market, although 
who knows what the story will be when 
it actually arrives. 

When that day comes, a product from 
a Bell Labs spin-off, Echo Logic, should 
come in handy. FlashPort lets you port 
Mac applications to the PowerPC in 
days. This is not to be confused with 
the emulation option: The new oper- 
ating system is supposed to run Mac 
software (and non-Mac software, too) 
at native 680x0 speeds via emulation 
modules called “personalities.” Flash- 
Port is an option between emulation 
and total reengineering: It involves a 
multistage analysis of program logic 
achieved through examining registers 
and is supposed to produce 90 percent 
of the performance of code written for 
the PowerPC. It will probably be very 
expensive, but Echo Logic is consider- 
ing setting up porting centers where you 
can buy a port. 

Apple, meanwhile, is making inter- 
esting moves toward another operating 
environment. QuickTime is not the on- 
ly piece of Mac system software being 
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ported to Windows; there are rumors of 
an Intel version of the Mac OS. Perhaps 
such rumors make Microsoft nervous; 
perhaps that’s what they’re for. Mean- 
while, though, Apple’s software sub- 
sidiary, Claris, is definitely porting its 
products to Windows, starting with a 
well-received port of FileMaker. 

That’s all strange enough, without 
NeXT negotiating something or other 
with Microsoft. 


Is This the Pro Shop? 
I want to wrap this up with a look at 
one Mac product, LinksWare. 

LinksWare. Sounds like golfer garb, 
doesn’t it? Little alligators on the shirts, 
sun visors, bad color combinations. Ac- 
tually it’s both a company and a prod- 
uct. The company is LinksWare Corp., 
812 19th Street, Pacific Grove, CA 93950; 
408-372-4155. The product is a clever 
idea, and that’s chiefly what I want to 
present here: the idea of LinksWare. 

Let me underscore that. This is de- 
cidedly not a review of the product, 
which I have yet to get to work quite 
as advertised on my system. Those last 
three words are significant, and could 
be the whole explanation; my Mac set- 
up is a little nonstandard, and LinksWare 
is a small company with limited re- 
sources. The many-hatted LinksWare au- 
thor Tracy Valleau shouldn’t spend those 
resources On accommodating weird 
hardware and software setups. If it looks 
like an actual review of LinksWare is 
warranted, I'll test it on another machine. 

Although LinksWare the company is 
small, LinksWare the product, or at least 
the idea behind LinksWare the product, 
is big. It's Ted Nelson’s and Doug Engel- 
bart’s and Vannevar Bush’s idea of hy- 
pertext. LinksWare is a tool for creating 
hypertext and hypermedia links between 
files. But it’s a cleaner implementation, 
conceptually, than many I’ve seen. 

LinksWare is not a separate environ- 
ment like HyperCard. It uses files cre- 
ated by all the major Macintosh word 
processors and painting and drawing 
programs, and it can also create links 
to video segments recorded in Apple’s 
QuickTime movie format. It doesn’t 
modify the original files, so it can link 
to files on CD-ROM. It doesn’t require 
any rekeying of data. It also doesn’t re- 
quire any programming or HyperCard- 
like manipulations. 

It makes a clear distinction between 
the developer and the consumer of hy- 
pertext/hypermedia products. LinksWare, 
the developer version, costs $189 and 
there are no royalties on documents pro- 
duced (or rather, linked) with it, nor any 
limitations on how you can distribute 
your linked hypertext. The supplied 
LinksWare Reader, which takes up 203K 
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on a disk, uncompressed, is basically a 
full version of the product except that it 
does not allow editing or adding of links. 
LinksWare Reader can be distributed 
freely, including being bundled with 
your hypermedia documents. 

LinksWare doesn’t read every file for- 
mat. It depends on Claris’s XTND and 
DataViz file-conversion technology to 
read files produced by other programs. 
XTND works via translators kept in a 
folder. LinksWare is sold with many of 
the more common translators, but more 
can be added by dropping them in the 
folder. Translators are typically provid- 
ed by interested vendors free of charge 
and distributed on electronic services. 
Many Mac products, particularly Claris 
products, come with a folder of trans- 
lators. 

Here’s the short story on how it 
works, from the developer’s viewpoint: 
To create a link using LinksWare, you 
click on a word or select an area of a 
graphic, then click on the file you want 
linked. In the current version, only 
words can be used as links in text files, 
but multiword links are planned for the 
next revision. LinksWare lets the hy- 
pertext author decide on how links 
should be shown to the reader: with 
italics, underlining, or whatever. 

And here’s how it works, from the 
customer’s viewpoint: When perusing 
the hypertext created by LinksWare, the 
reader notices that a word is a link and 
clicks on it. The associated file is opened 
and displayed. A keypress dismisses it. 

The full story, especially from the de- 
veloper’s viewpoint, is a little longer. 
LinksWare has several utilities for main- 
taining links: There is a file-finding rou- 
tine that resurrects links when the files 
have been moved or renamed, and 
menus give direct access to link words 
and linked files. A hypertext document 
can span up to 795 separate files, with 
up to 127 links from each text file or 64 
links from a graphic. There is a limit of 
6000 link words and 10,000 links alto- 
gether. 

That’s the description of the product. 
Not a terribly complicated idea, is it? And 
yet, conceptually, it’s closer to real hy- 
pertext than most of the products that 
are called hyper- something or other. If 
only it were built into the operating sys- 
tem; but no, not the operating system, 
because it ought to be machine-inde- 
pendent. If only it were—universal. 

Like Xanadu, for which we are still 
waiting, Ted. 


DDJ 
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is complete! Now create 32-bit C or Ctt applications 
for Windows or Extended-DOS, securing your future 
for Win32s and NT. Existing Windows C code can be 
recompiled to produce true 32-bit applications. And 
now we're busy shipping High C/C++ to those of you 
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C PROGRAMMING 


A Pensive Look at 
Pens and C++ 


his month is my fourth anniver- 

sary as DDJ/’s “C Programming” 

columnist, and it is also the an- 

nual C issue. For this special time, 
I’m going to suspend work on D-Flat 
and talk about some C++ issues and a 
potential pen-based D-Flat library. Both 
subjects are forward-looking, I hope, 
and appropriate for this issue. Next 
month wraps up the continuing saga of 
D-Flat. After that, we'll start looking at 
D-Flat++, the rewrite of the CUA inter- 
face library in C++. 

After four years of almost exclusive 
coverage of C-language programming 
in this column, I am thinking that the 
growing popularity of C++ needs more 
attention from this forum. I have been 
using C++ in my own development for 
quite some time and am happy with 
and confident in my personal prefer- 
ence for it. The time I devote to D- 
Flat++ and your reaction to it will chart 
the course of this column for the fu- 
ture. If you are predisposed to resist, I 
ask that you bear with me. Let me 
show through the comparison of D-Flat 
and D-Flat++ how the notational im- 
provements of C++ improve the craft. 
I might not be able to demonstrate it 
in this column, but I have learned, too, 


Al Stevens 


that a design process that concentrates 
on the objects first rather than the pro- 
cedures is more intuitive and produces 
a sounder and more maintainable soft- 
ware system. This is not news. Others 
have been saying this for years. But 
now there are C++ compilers at every 
level, and the rest of us can find out 
for ourselves. 
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PenD-Flat 

Pen-based computers are here for a 
while, and we should be aware of what 
that means to software development. A 
pen-based development environment 
will resemble one for an embedded sys- 
tem, where you develop on something 
other than the target hardware. No one 
would want to squint and scratch their 
way through a small clipboard-sized, 
handheld, compile-and-debug session 
with Borland C++, for example. I guess 
you could do your programming while 
you ride the exercise bike, lounge at 
the beach, or take that nature walk, but 
you'd soon lose patience as you grap- 
pled with the pen and pad to change 
code, set breakpoints, and look at vari- 
ables. Besides, not many of the little crit- 
ters will have the megabytes of disk and 
RAM to install the new breed of gar- 
gantuan compilers. No, the pen-based 
platform is not ideal for software de- 
velopment. You still need all the tools 
that you need for traditional program- 
ming, but you also need something 
hooked to the AT that simulates the pen 
and pad. 

This should give you a clue about 
what kind of applications are best 
served by pen-based software and what 
kinds are not. The notion that pen com- 
puters will proliferate in executive of- 
fices in the hands of keyboard-shy 
CEOs is silly. Today’s executives might 
be less than computer literate, but that 
is changing. Before pen-based technol- 
ogy is good enough for serious word 
processing and spreadsheets, it will be 
overtaken by two things—voice-based 
computing and a new generation of 
technology-sawvy executives. Therefore, 
do not plan to launch a pen-based word 
processor product. No one will buy it. 





Pen computing will wear a blue collar 
and will embrace vertical applications. 
It will realize its potential in walk-around 
applications where mobility and free- 
dom from cabling are important and 
where there are some data to be en- 
tered that cannot be scanned. The UPS 
person in my hometown uses a pen 
tablet to record deliveries and recipient 
signatures. It’s an experimental program. 
In some cities, you can already get a 
traffic ticket written and printed by a 
pen-based computer— RoboCop. Com- 
bine a pen with a bar-code reader, and 
you have an excellent warehouse-inven- 
tory input device. Scan the inventory 
number’s bar code and write the quan- 
tity-on-hand. Aluminum siding, screen 
enclosure, driveway paving, and roof- 
ing salespeople could use small pen- 
based CAD applications to design and 
cost home improvements while sitting 
in the customer’s living room. Vertical 
applications every one. Opportunities 
for programmers. 

Most pen-based systems have graph- 
ical operating environments at their 
foundation, and you need a GUI de- 
velopment environment. Microsoft Pen- 
Windows is one example. PenDOS, 
from Communications Intelligence Cor- 
poration (Redwood City, California), is 
a pen-based operating environment that 
runs in text mode on top of MS-DOS. 
When used along with a cooperating 
digitizing tablet and pen, PenDOS pro- 
vides a pen interface to text-mode DOS 
applications. Typical DOS applications 
need little or no modification to run with 
PenDOS. Most mouse and keyboard op- 
erations are handled by the environ- 
ment as if you entered them with tra- 
ditional devices. PenDOS emulates 
mouse actions by the movement and 
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tapping of the pen on the screen’s sur- 
face. It emulates keyboard input by pop- 
ping up a Writing Window into which 
you manually write text with the pen. 
The environment uses character-recog- 
nition algorithms to translate your scrib- 
bles into keyboard characters. Your en- 
tries are displayed in a text-entry 
window, and when you like what 
you've written on a line, you tap the 
Send button. PenDOS stuffs the char- 
acters into the application as if you had 
typed them. It works well, recognizing 
a variety of handwriting styles and re- 
quiring little practice to become com- 
fortable with its use. 

Even a pen-based application needs 
a user interface. I was not sure that the 
interfaces that work with the keyboard, 
screen, and mouse would be meaning- 
ful on the pen/tablet platform, so I de- 
cided to use D-Flat and PenDOS to ex- 
periment with a pen-based CUA 
interface. In theory, a D-Flat application 
would run under PenDOS with little or 
no trouble. This combination would of- 
fer the developers of pen-based appli- 
cations a CUA interface that uses pen 
taps and movements instead of mouse 
actions and hand-written pen text input 
instead of keyboard input. 

I used the Wacom HD-648A Hand- 
write Digitizer to test PenDOS with D- 
Flat. This device has a VGA-compatible 
LCD screen and a serial pen-input de- 
vice. The drivers that come with Pen- 
DOS work with the Wacom. The pen 
emulates a serial mouse. This particular 
Wacom device is intended for develop- 
ers who would test their programs with 
traditional PC hardware. Once de- 
bugged, the applications could be em- 
bedded into specialty pen-based DOS 
computers. The ideal testing configura- 
tion has the Wacom connected to the 
VGA port and the system console as- 
signed to a monochrome monitor—the 
typical two-monitor testing platform. 
That allows you to use the keyboard 
and monochrome screen to debug while 
the Wacom VGA device displays the ap- 
plication screens. 

With PenDOS and the Wacom digi- 
tizer installed, the display satisfactorily 


- emulated the LCD VGA. I tried out 


some DOS commands through the Writ- 
ing Window. It works, but users will 
probably not like entering DOS com- 
mands that way. You'll need to boot in- 
to the embedded application, I think. I 
ran D-Flat’s Memopad example pro- 
gram unmodified under PenDOS. The 
pen properly selected menus, chose 
menu commands, pressed command 
buttons, and selected items from list 
boxes, check boxes, and radio buttons. 
I could move and resize windows eas- 
ily with point-and-drag operations. Dou- 
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ble-clicking is double-tapping with the 
pen. If anything, the CUA mouse op- 
erations are easier and more intuitive 
with a pen pointed directly at a hori- 
zontal screen than they are with a 
mouse on the desk moving a cursor on 
a vertical screen. 

(A hint. If you lose pencils, combs, 
car keys, eyeglasses, and your TV’s re- 
mote control with predictable regulari- 


The gray areas in 
C++ get darker when 
you get into classes 
and inheritance, 
and that is where 
many programmers 
have the most 
trouble 


ty the way I do, then you'd better tie 
the pen to the pad with a strong string. 
Until I did, that pen disappeared at least 
daily and then showed up in the darne- 
dest places.) 

All D-Flat needs to make it a complete 
pen-based user interface is an edit-box 
control that uses handwriting input sim- 
ilar to the PenDOS Writing Window, and 
the PenDOS “gesture” characters for text 
editing with a pen. I have to return the 
borrowed Wacom digitizer, so there is 
not time to do that, but I might get an- 
other crack at it later, perhaps as a part 
of the D-Flat++ project. 


C/C++ in Action 

I attended the C plus C++ in Action con- 
ference in Teaneck, New Jersey during 
the week of April 27. This conference, 
which has a London edition in June and 
one in Santa Clara, California in Septem- 
ber, is organized and operated by the 
Wang Institute of Boston University. Al- 
though it aims at both C and C++, the 
strong emphasis of the conference and 
the attention of the attendees was def- 
initely on C++. I sat on a panel that dis- 
cussed C and C++ as a family of lan- 
guages. Christopher Skelly, the technical 
chairman of the conference was mod- 
erator. The panel included Bjarne Strous- 
trup, Jim Brodie, Dmitry Lenkov, PJ. 
Plauger, Jim Coplien, and me. Bjarne, 
of course, designed C++. Brodie is chair- 
man of the X3J11 ANSI C Standards 
committee. Lenkov is chairman of the 
X3J16 ANSI C++ Standards committee. 


Plauger is a well-known author and 
speaker on C and a member of X3J11 
and its ISO counterpart. Coplien is an 
AT&T researcher and instructor and the 
author of Advanced C++ Programming 
Styles and Idioms. As you might expect, 
this motley panel contained overlapping 
camps with overlapping agenda—lan- 
guage designer, language standardizers, 
authors about language, and language 
users. 

The panel discussed several concerns, 
among them the feeling in the C/C++ 
community that the independent efforts 
to standardize and extend the two lan- 
guages could make them less compat- 
ible than they are now, creating a water- 
shed between C and C++ and polarizing 
rather than assimilating the two groups. 
The panel’s consensus was that C and 
C++ should become no less and no 
more compatible than they are today. 

There was a lot of attention given to 
what might happen with C and C++ in 
the next several years, and less given to 
what programmers in the audience had 
to deal with on the following Monday 
morning when they went back to work. 
The questions from the audience indi- 
cated—to me, at least—that they were 
as interested in the latter issue as in the 
former. The panel’s discussion was di- 
rected more to the former. 

One member of the audience asked 
why, in a session he attended, most of 
the other attendees held up their hands 
when asked who was a C++ program- 
mer, yet few of them admitted to un- 
derstanding or using object-oriented de- 
sign and programming. Wasn't OOP, 
asked the programmer, the primary ad- 
vantage that C++ had over C? And, if 
so, how come so few C++ programmers 
understood or practiced it? 

Most of those programmers have 
learned the notational extensions that 
C++ adds to the C language. They know 
about C++ comments, references, glob- 
al scope resolution, default function ar- 
guments, inline functions, mixing dec- 
larations and procedural statements, 
const, anonymous unions, unnamed 
function parameters, and structs and 
enums as discrete types. And they agree 
that C++ is easy to learn and use—as 
long as you don’t have to design and 
use classes. The gray areas in C++ get 
darker when you get into classes and 
inheritance, and that is where many pro- 
grammers have the most trouble. 

And that, in a nutshell, is the number 
one failure of the C++ evangelism, I be- 
lieve. The most powerful feature of C++ 
is the one that is most difficult to learn 
to use, and we are not moving fast 
enough to correct the learning problem. 
There are real advantages to using C++ 
over C. There are few programming cir- 
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C++ Object Organizer 


Object Manager Is Made-To-Order. 


Use Raima Object Manager class library to 
simplify your C++ code. Store your objects 
(and records) on disk, and retrieve them 
using advanced caching techniques. 


Streamline Your Code For Speed. 
A single C++ statement, 

MyObject [“Mouse’” }]; 
will retrieve an object—fast. Object Manager 
is built on the high-performance Raima Data 
Manager database engine — top-rated for 
performance in benchmark tests by BYTE 
Magazine.* 


Be Persistent. 


Object Manager is a natural extension of 
C++. Storing and retrieving objects 1s as 
easy as inheriting from a C++ class. 


Simplify Complex Applications. 
Object Manager base classes provide 
several powerful data navigation 
methods. You can easily navigate 
between objects and maintain "real 
world" relationships, whether you're 
working on a customer object with a 
complete list of transactions or a bitmap 
with related multimedia data. 


Put Object Manager In Its Place. 
Add Object Manager to your toolkit 
alongside your favorite windowing and 
interface class libraries. 

For more infomation call: 
1-800 ‘DB: state: 206-747-5570 
*BYTE Magazine, January 1992, "Database Building Blocks" 
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(continued from page 134) 
cumstances where C++ will not do a 
better job. One of those circumstances 
is when the programmers do not un- 
derstand the advantages and are not 
sold on giving it a try. It is a public re- 
lations problem as much as anything 
else. Programmers are immunized to 
hype. 

The question was asked, “When 
should you decide to use C and when 
should you decide to use C++?” Most 
of us had seen or heard about projects 
where the decision to use C++, and 
particularly object-oriented design, was 
made because someone in a position 
of influence had been told that C++ is 
not only trendy but a panacea as well. 
In virtually every such instance, the 
project failed. If the problem domain 
is already supported by a mature li- 
brary of C functions, then C is a good 
choice. If the programming staff has 
neither experience with OOP/C++ nor 
the inclination to learn, then C is still 
a good choice. If neither condition ex- 
ists, then C++ is a good choice. But the 
one compelling decision point is the 
bent and the ability of the program- 
mers to use C++, and that is not always 
foregone. | 

To further illustrate, let me retreat to 
an earlier conference session where Jim 
Coplien described some advanced ways 
to make C++ do things that you might 
want it to do. This was an intense, in- 
formative, and entertaining session. Jim’s 
voice and delivery remind me of TV 
news-journalist Michael Kinsley, who 
baits his co-host and guests on “Cross- 
fire” every week, although Jim gets his 
point across without making you want 
to punch him in the nose. I'll discuss 
two of his C++ techniques without try- 
ing to explain how they work. In the 
first place, I might get it wrong. That 
notwithstanding, there is a much better 
source for this information, and I high- 
ly recommend Jim’s book, Advanced 
C++ Programming Styles and Idioms, 
which addresses these two methods and 
much more. I'll discuss the book later 
in this column. 

The first technique, called the “count- 








Example 1: The counted-pointer 
technique. 
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ed pointer,” shares data values between 
the first instantiation of an object and 
any copies you might make through as- 
signment. It overloads the -> operator 
so that you can use it to get at the com- 
mon representation. With the class de- 
fined according to Jim’s specification, 
you can write the code in Example 1. 

Note that although s7 is not a point- 
er, the notation treats it like one. Nev- 
er mind why you would want to do this. 
I’m not sure I would, but Jim gave ex- 
amples of where this approach solves 
a particular problem. 

The second technique laments the 
lack of a virtual constructor in C++ and 
uses what is called the envelope/letter 
idiom to give the effect of virtual con- 
structors. The using program deals on- 
ly with objects of an envelope class, yet 
the objects themselves take on different 
characteristics, depending on the con- 
text in which they are constructed, as 
defined in letter classes. This goes a bit 
farther than simply overloading con- 
structors, because you can add contexts 
later without modifying the envelope 
class, thus giving the effect of a virtual 
constructor mechanism. Example 2 
shows some code that uses objects of 
a class defined with the envelope/letter 
idiom. 

All three variables are of type Num- 
ber, yet each has a different data struc- 
ture and behavior. Once again, don’t 
worry about understanding why a pro- 
grammer would want to do this. The 
point is that you are being told that it 
is possible with C++. 

Now, imagine a discussion between a 
C++ guru and a C programmer who has 
been told that C++ is the future. The gu- 
ru, having attended Jim’s session or read 
Jim’s book, foolishly begins to explain 
some of these methods to the C pro- 
grammer expecting these features to be 
a strong selling point to a C++ rookie. 


Guru: You can overload operators in 
C++. You can even overload the addi- 
tion operator to perform subtraction on 
your numerical classes. 

Programmer: That’s good? 


G: No, that’s bad. It would not be intu- 
itive and would be poor programming 
practice. You might write a program that 
no one else could read. 

P: So overloading operators that way is 
always bad. 

G: No, sometimes it’s good. There are 





times when you want to overload an 
operator in nonintuitive ways. Consid- 
er the counted pointer. Its -> operator 
is overloaded so that you use it to the 
right of an object that is not a pointer. 
P: | don’t understand why one exam- 
ple is good and the other is bad. 


G: Because finding a solution to the 
problem outweighs the need to write 
intuitive code. If it offends you, don’t 
use it. 

P: Then why are you telling me about it? 


G: Because you might need it some day, 
and to show you another less-than- 
obvious way that C++ is extensible— 
how it lets you go beyond the original 
language design. 

P: What are some other ways? 


G: Consider the letter/envelope idiom 
that gives you the effect of virtual con- 
structors. You can instantiate three nu- 
merical types by saying, Number a, b, 
c; and then you can say a=123, and 
a will be an integer. You can say b= 
456.78, and b will be a float. You can 
cast d+b to a complex number, assign 
it to c, and c will be complex. There 
you have three variables, all of type 
Number, and each one has a different 
internal data representation and differ- 
ent behavior, based on the context in 
which you assigned something to it. 
Neat, huh? 

P: (Getting hot) Wait a minute. I could 
do that 15 years ago with Basic and I 
didn’t have to jump through all these 
stupid envelope/letter-class hoops. I 
want to use C++ because it has strong 
typing, and they tell me that’s what I 
need. What happened to all that strong 


typing? 


G: (Smugly) With C++ you have the 
freedom to bypass it. 
P: If I want freedom, I'll stick with C. 


And there, dear hearts, is where you 
lose a convert. This is typical of the hype 
that obscures C++. Bjarne observes that 
programming shops have a lot of rea- 
sons not to switch to C++, and he ef- 
fectively counters each one of them. But 
his message is delivered to a kindred 
audience. The attendees of these con- 
ferences are predisposed to like Bjarne 
and the language that sprang from his 
labors, even when they don’t fully un- 
derstand it. But there is a strong C con- 
stituency out there, and, until recently 
at least, that constituency was growing. 
Many of them don’t live, breathe, and 
sleep programming and every new 
paradigm shift, and they don’t go to the 
conferences to worship at the graven 
C++ image. Those are the folks we need 
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C PROGRAMMING 


(continued from page 136) 
to reach, and they are mainly skeptical. 

Consider what C programmers see 
when they look at source code. Unless 
there are some tricky preprocessor 
macros, what they see is what they get. 
The C source code reveals everything 
the C compiler is doing for the pro- 
grammer. If a function is going to be 
called, the programmer must call it. If 
a variable is going to be declared, the 
programmer must declare it. Not nec- 
essarily so with C++, where a lot goes 
on under the surface. The simple state- 
ment that assigns one object to anoth- 
er can launch a barrage of constructor 
functions and hidden temporary objects. 
Until the programmer is comfortable 
with that sort of semi-organized chaos, 
there will be some furrowing of bead- 
ed brows when he or she steps through 
the code with a debugger. 

Then there are the many hidden fea- 
tures in C++ that you cannot reveal up 
front, lest you scare the pants off the 
programmer the way our C++ guru did 
a few paragraphs back. There is a lot 
to learn after you learn the language. 
First you solo. Then you get your pilot’s 
license. Then you learn how to fly. 
Learning C++ is a process of discovery. 
Teaching C++ is managing that process. 

For example, I want to call a derived 
virtual function from within a base con- 
structor. Why not? It seems like a good 
idea. I’m stepping through the program 
now. How come I’m executing the 
base’s function instead of the overriding 
virtual derived one? It’s right there in 
the code. The compiler knows about it. 
(Slow down. Let it sink in. The base class's 
constructor is running before the derived 
class has been constructed. There is no 
derived virtual function yet.) What’s that 
you say? There is, too. I compiled it my- 
self and it’s in the memory map. Don’t 
tell me it doesn’t exist yet. (Well, ves it 
does, but not in the name of the object 
you are instantiating at this time. Virtu- 
al functions are called through the ob- 
ject’s pointers to the derived class’s virtu- 
al function table, and your object hasn't 
built those pointers yet.) How was I sup- 
posed to know that? (Vow you know.) 

Here’s another example. I'll just call 
a constructor function from another con- 
structor function for the same class. One 
of them does most of the work, so I'll 
let the others do their unique process- 
ing and then call the principal one. 
What’s this? It doesn’t work. The prin- 
cipal constructor never executes. So, I 
step through the code with the debug- 
ger. There’s the constructor. I’m exe- 
cuting it. How come what it is con- 
structing right here before my very eyes 
never gets constructed in my object? 
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destructor function! What am I doing 
here? (Slow down again. When you 
called that constructor function, you in- 
stantiated an unnamed, hidden, tem- 
porary object. That’s why the original 
object does not get constructed properly; 
you are constructing one you never see. 
When you exited from the original con- 
structor function, the temporary object 
went out of scope, and its destructor was 
called.) Why can’t I just call that con- 
structor and get it called the way I want? 
(Because the syntax for calling a con- 
structor function is one of the several 
ways that you instantiate an object.) 

These are just some of the traps and 
pitfalls that await the unwary fledgling 
C++ programmer. Word about them 
leaks out, and programmers get intim- 
idated. You have to discover every one 
of them for yourself. They are neither 
intuitive nor obvious in the language 
syntax, and even when an instructor or 
tutorial book tells you about them, you 
forget until they bite you. Programming 
is supposed to get easier with each new 
shift, not harder. Hah. 


Coplien’s Book 
The literature of C++ and object-orient- 
ed design and programming is improv- 


ing. As promised, here are my reactions 
to James O. Coplien’s Advanced C++ 
Programming Styles and Idioms (Addi- 
son-Wesley, 1992). To begin with, do 
not use this as your first book on C++. 
Its title should tell you that. The book 
assumes that you already know the syn- 
tax of the C++ language, and its em- 
phasis is on programming idioms, which 
use the features of C++ to “express func- 
tionality outside the language proper, 
while giving the illusion of being part 
of the language.” This is extensibility to 
the extreme. C is extensible in that you 
can add functions that modify data val- 
ues and program flow. C++ is more ex- 
tensible in that you can add data types 
with their own behavior and represen- 
tations. The programming idioms that 
Coplien advances extend C++ further 
still, apparently changing the behavior 
of the language to solve problems not 
addressed in the language design. The 
book begins by discussing some tradi- 
tional idioms that even the newest C++ 
programmer will recognize—copy con- 
structors that solve the memory-man- 
agement problem and iostreams that use 
familiar overloaded insertion and extrac- 
tion operators, for example. But gradu- 
ally, the book gets into the more arcane, 





such as the counted pointer and enve- 
lope/letter classes discussed earlier. 

That you can do these tricks with C++ 
and that the practice gains credibility by 
endorsements such as this book are what 
give pause to many an outside observ- 
er. Some of this stuff is the bungee jump- 
ing of programming. Programming on 
steroids. If you do not like C and you 
do not want to like C++, this book will 
reinforce your bias. If you like C and 
C++, this book will reinforce those feel- 
ings, too. Congratulations, Jim, you’ve 
written a book that everyone will like. 

Coplien’s close association with C++ 
since its inception allows him to pro- 
vide personal insight into some of its 
history and how parts of the language 
evolved. Even if you think you already 
know C++ pretty well, ’d recommend 
this book. There is always more to learn, 
and the book is well written—a delight 
to read, in fact—and may very well be 
the most important programming book 
since K&R. 


DDJ 
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Development Conferences. This year 
more than 5,000 of the leaders in the 
development field came to Software 
Development Spring — that’s 100% more 
than last year! 

Why were they there? To catch up on 
the latest trends in programming — from 
object-oriented programming in C++ to 
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to their current programming dilemmas. 
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the software field. 
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ness. Software Development happens 
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At Software Development Fall the 
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Steve Mellor, Tom Plum, Richard Hale 
Shaw, Bruce Eckel, Larry Constantine, 
Andrew Shulman, and Tom Swan to name 
just a few. 

These aren’t just great speakers, they 
are great teachers who can provide pro- 
gramming solutions you can take back to 
the office and put to use immediately. 

Come to Software Development Fall 
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gized and ready to take on your next big 
challenge. 
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14-18, 1992 at the World Trade Center 
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Making Patents 
Work 


atch close, gang (and no gig- 
gling!). The boy is about to 
do something he doesn’t do 
often: change his mind in 
public. Furthermore, the subject in ques- 
tion is one I thought I’d fight to the 
death in opposition: software patents. I 
think about it a lot, because software 
patents could still mean the end of the 
American edge in software develop- 
ment, especially for the small develop- 
er. I’m glad, at times, that I make my 
living writing about programming rather 
than actually doing it, because it’s en- 
tirely possible that software patents 
could bring progress in programming 
technology to a screeching halt in an- 
other ten years. 
It doesn’t have to be that way. 
Software patents could actually work 
forthe small developer and not against 
him. The problem, in fact, is not with 
software patents at all, but with the 
patent system itself, as it is currently in- 
flicted upon both hardware and soft- 
ware. This notion has been burning a 
hole in my pocket, so forgive me if I 
step back from Turbo Vision for half a 
column and explain. 


What are Patents For? 
People who have defended software 
patents from the outset claim that pat- 


Jett Duntemann, KG7JF 


ents are absolutely necessary to ensure 
that an inventor’s investment in time 
and effort is rewarded and not simply 
appropriated by someone else. Right 
on, brother. Trouble is, patents don’t do 
that. They don’t even come close. Pat- 
ents currently ensure nothing except 
that the lawyers will get paid, and that 
the guy with the deepest pockets wins 
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the war 85 or 90 percent of the time. 

This is crazy. An occasional inventor 
wins the war against big infringers, like 
that chap who invented a certain kind 
of pliers and won big against Sears Roe- 
buck some years ago. We regularly hear 
about the rare cases like this, and almost 
never about the inventors who run out 
of money halfway through an infringe- 
ment case and simply give up, having 
not only lost their invention to infringe- 
ment but frequently all their savings and 
credit fighting that infringement. 

The current patent system is brutally 
stacked against individual inventors, 
who nonetheless petulantly defend it 
because they see it as all they’ve got. 
What they don’t see (remarkably, if they 
call themselves inventors) is that the 
patent system could well be improved 
several thousand percent, simply by re- 
focusing it on the jobs it is supposed to 
do: 1. Ensuring that research and in- 
vention is rewarded; 2. creating a busi- 
ness climate in which engineering art 
can progress briskly. 


Stealth Patents 

Currently, patents work diametrically 
against both of those stated aims. The 
biggest sin committed by our patent pro- 
cess against Job #1 is that patent appli- 
cations are kept secret until patents are 
granted. People can invest a great deal 
of time and money independently in- 
venting something, only to discover that 
someone else suddenly owns the idea, 
and everything they’ve done is lost, or 
else at the mercy of the new patent 
holder. 

Such “stealth patents” rob the inde- 
pendent inventor of his or her research 
investment, and I’ve never heard a good 
explanation of why this must be so. 
Patent applications should be published 
immediately, so that persons indepen- 





dently researching the same concept 
have a chance to give up the fight be- 
fore they lose their shirts, or else ad- 
vance the engineering art by innovating 
around the published application. Also, 
making patent applications public im- 
mediately would allow prior art to be 
discovered and put forth by people out- 
side the patent office before the patent 
is granted. Right now, prior art must be 
proven in court after a patent is grant- 
ed, and all court fights are frightfully ex- 
pensive, ultimately benefiting only our 
smug ruling class of trial lawyers. 


Locking Out the Little Guy 

The key problem with patents, howev- 
er, is that they’re used by large concerns 
to control markets and lock out small 
startup companies and individual in- 
ventors. IBM and other large companies 
frequently cross-license their patent port- 
folios at little or no cost to one anoth- 
er, in a sort of you-use-my-stuff and [’Il- 
use-your-stuff deal. However, the small 
firm holding one minor patent has lit- 
tle to offer the Big Guys and must ac- 
cept their royalty terms, usually a per- 
centage for each patent licensed. Having 
to pay cash to license a complicated 
web of patents can make a product eco- 
nomically impossible. The large com- 
panies can freely use one another’s 
patents as part of technology exchange 
licenses. The little guy is locked out. 
None of this should be surprising. Big 
companies like IBM and TI are awe- 
somely inefficient, and they fear noth- 
ing so much as the hungry, low-over- 
head technology partnership working 
miracles in their jeans in a crufty part 
of South Phoenix. 


Crazy Claims and Prior Art 


I heard a talk given to hopeful inven- 
tors by an “invention consultant” some 
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years ago, and it was an eye-opener. 
The most important thing about a pat- 
ent, according to the consultant, was to 
draft the claims portion of the patent 
such that innovating around the patent 
was impossible. The idea is to make 
them as broad as possible without caus- 
ing the examiners to reject the claims 
as unrealistic. In most cases, the chap 
went on, the patent office will let ex- 
tremely broad claims pass—and once 
you have the patent, you can take in- 
fringers to court with the legal edge on 
your side. 

If patent claims are intended to make 
further innovation impossible, they’re 
working directly against the core of the 
patent idea, which is to make innova- 
tion rewarding and encourage the ad- 
vance of engineering art. Standards for 
breadth of claims must be tightened, and 
existing patents with overly broad claims 
should be rescinded as fraudulent. 

Prior art has proven a serious prob- 
lem in software patents, in part because 
most software innovation has been pro- 
tected as trade secrets rather than pat- 
ents, and thus not published at all. But 
simple ignorance on the part of the 
patent office has allowed many absurd 
applications to obtain the legal strength 
of a granted patent. 


Paul Heckel’s ridiculous Zoomracks 
patent is the best example I’ve seen. 
Paul patented what amounts to a pic- 
ture drawn on a text screen of some- 
thing I used many years ago as a clerk 
at Xerox: the Ring-King Visible Records 
Rack, a steel plate covered with plastic 
card-holders that overlapped such that 
only the bottom quarter inch of each 
card was exposed. You could scan the 
rack for the name of the customer, then 
flip the card up and read the rest of the 
data. Zoomracks works pretty much this 
way. There’s some additional gob- 
bledegook about compressing data by 
yanking out vowels that looks a great 
deal like 1950s secretarial shorthand. 

Should drawing a (marginal) screen 
picture of some device commonly used 
in the outside world be patentable? Is 
shorthand patentable? 

Don't be silly. Yet the patent office 
let it pass. 


So What's Obvious? 

When our current patent system was 
designed, virtually all inventions were 
mechanical in nature, and relatively sim- 
ple at that. Any reasonably educated 
person could tell if an invention was 
“obvious” or not. Well, technology has 
unfolded in countless ways, and the na- 


EVALUATOR 


THE NON-INTRUSIVE 
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Software testing used to be a tedious and time consuming task...but not any 


longer. 


Quality conscious software developers are now using the Evaluator 
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The Evaluator allows software to be tested unattended 24 hours a day, 7 days 
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tested. Mini and Mainframe software can also be tested. 


Since the Evaluator is hardware based and no code is loaded on the Target 
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based application. Some features of the new GUI Toolkit: ICON/PATTERN 
Recognition, X, Y Mouse Control, lightening fast synchronized testing. 


Testers and Developers "trust" the Evaluator. 


ture of obviousness is now something 
known best to regular practitioners in 
the field. This is doubly true today of 
software patents, which is an area that 
has not, until very recently, been sub- 
ject to patents, and which most patent 
examiners know very little about. 

What needs to be done is to convene 
volunteer examiner panels in each of 
numerous technological specialties, in- 
cluding software, and have the panels 
rule on whether applications are at- 
tempting to patent the obvious. Let the 
panels’ decision be final—or at best, al- 
low one appeal to a second panel con- 
vened of different individuals from the 
same field. 


The Big Win 

But above all else, bar nothing, the re- 
form that could make our patent sys- 
tem realize its stated goals is to remove 
patent-holders’ ability to control mar- 
kets. Right now, if you hold a patent, 
you can offer it to some licensees but 
not others, charge different rates to dif- 
ferent licensees, and all manner of mar- 
ket-manipulation mischief like that. My 
reform proposal is to create a system of 
competing rights collectives that would 
represent inventors and collect royalties 
on patents for them, taking a small fee. 


To arrange a demonstration or obtain more information please contact 
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Quick C for Windows, Borland C++, Turbo Pascal for 
Windows, Visual Basic, and any language supporting 


DLL's. 
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STRUCTURED PROGRAMMING 


(continued from page 142) 

Inventors would be allowed to negoti- 
ate other licenses, but they would be 
required by law to make their inven- 
tions available for a standard, regulated 
maximum fee to anyone who wishes to 
license them. 

The music business works a lot like 
this right now. You don’t have to indi- 
vidually negotiate the right to perform 
a song in a bar with each songwriter. 
Instead, you buy a license from ASCAP, 


ASCAP has a huge 
war chest for 
prosecuting 

infringers, which 
they do relentlessly 
and remorselessly, as 
many small bar 
owners will surely 
attest 





which distributes the royalties to par- 
ticipating songwriters. This works well, 
as judged by the enthusiasm of the 
songwriters for the system. 

ASCAP has a huge war chest for pros- 
ecuting infringers, which they do re- 
lentlessly and remorselessly, as many 
small bar owners will surely attest. I 
think it’s a great system. I think it would 
work phenomenally well for patents. 

I envision it happening this way: Some 
portion of a product’s net receipts would 
be earmarked for patent royalties. I think 
15 percent would be about right, since 
(having done a little product developing 
myself) I think that no more than 15 per- 
cent of the manufacturer’s portion of the 
value of any given product is pure in- 
novation. Much of it is marketing, doc- 
umentation, and simple implementation 
of public-domain art. 

A concern wishing to license patents 
would register a product with one or 
more rights collectives, specifying which 
patents it is licensing. Those rights col- 
lectives would split the 15 percent equal- 
ly, and be constrained from quibbling 
over which patent contributed “more” 
to the product’s ultimate value. Such 
questions only enrich lawyers and do 
nothing for either inventors or the en- 
gineering art. A valid, paid-up license 
with a collective would protect the li- 
censee from any patent litigation of any 


kind from that collective. 

With the collectives working for them, 
doing the legal wrangling and crunch- 
ing the paperwork, inventors could do 
what they do best. Better still, other in- 
ventors could build on the work of their 
fellows, and those with the most- 
licensed patents would get the most 
money. Inventors would be rewarded, 
and the progress of technology would 
go absolutely through the roof. 

I would support this system. I would 
holler from the heights in favor of this 
system. I would even support nonobvi- 
ous software patents under this system. 
I can foresee that IBM, TI, and Apple 
would oppose it, as would (of course) 
the lawyers’ bloc. 

Could it ever happen? Who knows? 
The USSR is history—weirder and 
tougher things have happened. We’ve 
got a long way to go. And knowing 
what I know of lawyers, I’m still 
damned glad I’m a writer and not a pro- 
grammer. 


A Mortgage-ing We Go 

Enough of that. We’re riding the whirl- 
pool here, trying to make sense of Tur- 
bo Vision streams. I provided an over- 
view of how streams work last month. 
This time, we’ll start having a look at a 
practical example, a revision of HCALC 
PAS that has the ability to write mort- 
gage tables to a stream and read them 
back again. 

My first job in adding streams to 
HCALC was, in fact, adding streams to 
the mortgage object itself. The revised 
MORTGAGE.PAS is given in Listing One 
(page 164). HCALC is a serious chunk 
of code and will have to wait until next 
month; there’s plenty to talk about in 
the meantime. 

I had to make three general mods to 
MORTGAGE.PAS: 


e | added the 7Mortgage type to the Tur- 
bo Vision object hierarchy by making it 
a child of TObject. I explained the rea- 
sons for this in detail last month: The 
very first field in any streamable object 
must be a pointer to that object’s VMT. 
Since TObject is itself the child of no 
object and has a VMT pointer as its first 
field, all objects that descend from 7- 
Object will have the requisite VMT point- 
er as their first field. 

e I created a stream-registration record 
for TMortgage. It’s called RMortgage, and 
it exists primarily to let the Turbo Pas- 
cal RTL know where a given object 
type’s Store and Load methods exist in 
the code segment. Consider it a record 
in a behind-the-scenes index file that 
the RTL keeps of all its VMTs and stream 
access methods. 

e | added a Load and a Store method 
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(continued from page 144) 

to the TMortgage object. The Load 
method must be a constructor, because 
what it does is very similar to what the 
quintessential constructor /nit does: It 
builds an object on the heap. To do this, 
it uses information it reads from the 
stream. Jnit, by contrast, builds an ob- 
ject on the heap from information hard- 
coded into the constructor itself. 


Store, on the other hand, is an ordi- 
nary method. 


Looking Closely at TMorfgage. Store 
Storing an object out to a stream is rel- 
atively easy. All the data is right there, 
intact and accessible. As I show in the 
TMortgage.Store method, you simply 
blast out an object’s fields one at a time, 
using the Write method belonging to 
TStream. Make sure you specify the 
stream when you call Write! That is, 
make sure the call is S. Write Gif the name 
of your stream is S) rather than just 
Write. 

One caution: If the object for which 
you're writing a Store method has a par- 
ent object with a Store method (TMort- 
gage does not, because its parent, 7- 
Object, has no Store method), you must 
call the parent’s Store method before 


From Abraxas Software, Inc.... 


beginning to send your own object’s 
fields out to the stream. You'll see how 
this works next month, in the Store 
methods belonging to HCALC’s 7Mort- 
gage View objects. 

The Write method needs the name of 
the field, and the number of bytes of 
data to be stored from that field to the 
stream. The best way to do this is to use 
the built-in SizeOf function on the field’s 
type specifier, like so: 


S.Write(Principal,SizeOf(Real)); 


Since type Real is six bytes long, this 
statement writes six bytes of data con- 
taining the field Principal out to stream 
instance S. 

Only the last Write is vaguely tricky. 
If you remember from earlier discus- 
sions of the T7Mortgage type, the mort- 
gage amortization table itself is a dyna- 
mically sized array on the heap. This 
allows you to have a 15-year, 30-year, 
or 247-month mortgage if you want, and 
not waste any heap space. The Pay- 
mentSize field is a long integer con- 
taining the total size in bytes of the Pay- 
ments\ dynamic array. PaymentSize, 
passed as a parameter to S. Write, allows 
you to write only the exact amount of 
data to the stream to embrace the full 


length of the amortization table—and 
no more. 

You'll note that although TMortgage 
has a pointer field named Payments, 
that pointer field is not written to the 
stream. Pointers are 32-bit addresses of 
memory locations on the heap. Writing 
them to disk is kind of pointless, be- 
cause there’s no promise that when you 
bring a pointer back from disk, it’s go- 
ing to point anywhere meaningful. We 
use the Payments pointer to help get 
the amortization table out to the stream, 
but that done, we no longer need Pay- 
ments in the stream-writing process. 


Getting Things Back from the Stream 
What gets written out to the stream with 
Store gets read back from the stream 
with Load. The Load method is con- 
ceptually similar to Store: You use the 
stream’s Read method to bring fields in 
from the stream, one by one, in the 
same order that they were written out 
with Store. Again, you must tell the Read 
method how many bytes of data to 
bring in from the stream with a second 
parameter. 

TMortgage.Load doesn’t attempt to 
read a value for Payments from the 
stream. We didn’t write out anything for 
Payments to begin with. Instead, we al- 
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locate (with GetMem) just enough space 
on the heap to contain the table and 
store the address of that block of heap- 
space into Payments. We had previously 
read PaymentSize from the stream, con- 
taining the correct size of the amortiza- 
tion table. With Payments pointing to a 
correctly sized block of heapspace, we 
can load the amortization table onto the 
heap directly, with S.Read: 


S.Read(Payments/,Paymentsize); 


That’s all it takes to get the mortgage 
object itself to and from the stream. It 
may seem obvious, but for complicated 
objects with loads of fields, be careful 
to read fields back from a stream in the 
same order that they were written out. 


More from the Confusion File 

Which isn’t to say that figuring it all out 
was easy. On page 157 of the Turbo Vi- 
sion Guide, it says, “Turbo Vision reg- 
isters all the standard objects, so you 
don’t have to.” That makes sense. Too 
bad that on page 163 it says, “The rule 
is simple and unforgiving: It’s your 
responsibility to register every object 
type that your program will put onto a 
stream.” Just my own? Or the standard 
ones too? 

The answer (in case you were won- 
dering, and if you’ve tried to do any- 
thing at all with streams then you’ve 
probably been wondering until your 
nose bled) is that page 163 has it right: 
Turbo Vision registers nothing for you. 
It defines registration records for all 
standard types (which is what I think 
the marginal note on page 157 was try- 
ing to say) but you have to call the 
RegisterType procedure for each stan- 
dard type you intend to put out to a 
stream. 

Note that this does vot mean that you 
have to register the object types that you 
inherit from. 7TMortgageView inherits 
from TWindow, but I don’t have to reg- 
ister TWindow—just the exact object 
types that must be streamed. The prob- 
lem is that TWindow is a group, with a 
7Frame object attached to it through a 
pointer. TWindow’s own Lodd and Store 
methods know how to deal with that 
frame object, so that you never have to 
bother worrying about writing the 
TFrame to the stream. However, you 
must still register TFrame yourself. 

I got seriously messed over on this 
one. While trying to put a 7Mortgage- 
View object out to a stream, I kept get- 
ting an I/O error code back from the 
stream. The code showed up as —6, 
which equates to the stPutError con- 
stant, indicating (see page 371 of the 
Turbo Vision Guide for the full st-series 
stream error-code listing) that I was try- 
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ing to put an unregistered type onto the 
stream. I tried to register TView, TGroup, 
TWindow, and several other things be- 
fore I remembered that every TWindow 
object comes with its own 7Frame. 
Gakkh. 


Do it Once and Then Stop! 

All the standard types do, however, have 
predefined registration records, using a 
standard naming convention: Replace 
the T at the beginning of the standard 
type name with the letter R. Thus, the 
registration record for TFrame is 
kFrame. All of the standard types pro- 
vided by Borland have object ID codes 
under 100, so any number over 100 is 
fair game. 

Trying to register a type a second time 
will cause runtime error 212. This is a 
catchall error message that will trigger 
if any of several things go wrong while 
registering a type, but the two causes 
to watch out for are registering a type 
with the same ID code as an already- 
registered type, and registering an al- 
ready-registered type. From TV’s per- 
spective, those two errors are identical, 
because it’s the unique ID code in a reg- 
istration record that defines a registra- 
tion record as unique, vot the name you 
give the record. 

From your perspective, however, the 
first cause is generally choosing an ID 
code that some other type already us- 
es, and the second is registering types 
in two or more different places in your 
application. To avoid confusion, gather 
all your registration calls into a single 
procedure. And take pains to note the 
registration ID codes of any third-party 
objects you incorporate into your ap- 
plications. And if you yourself provide 
streamable objects to other program- 
mers, complete with registration records, 
be sure to make it plain in comment 
headers or documentation what those 
ID codes are. 


Closing in on It 

That’s my word budget for this session. 
We're actually closing in on the target, 
and it shouldn’t take more than two 
more columns to peg streams reason- 
ably well. Next month I'll provide the 
updated HCALC.PAS listing, and we'll 
speak of peer view pointers and other 
irritations. Month after that, well, it might 
well be time to evaluate Turbo Vision 
on a cost-benefit basis, and look around 
at other ways to skin the same cat. 


DDJ 
(Listing begins on page 164.) 
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C CODE FOR THE PC 


source code, of course 


DBAPrep (embedded SQL to C translator; supports Oracle, Sybase, SQL Server, XDB, Novell XQL, SQLBase, and QE-Lib) ...... $1,500 
Embedded DOS (full-features, real-time, multitasking, 3.31-compatible DOS for embedded system and self-bootable installations) . . . . . $375 
Style (a C+ + class library to build, maintain and traverse arbitrary, non-taxonomic links between C++ objects) . . 2... ......028. $300 
a ime Seal checker for incorporation into text products; no royalty; large dictionary; small and 2 io ee oe ee ee eS $300 
IP Image Processor & Victor Image Library Version 2.0 (brightness, contrast, merge images, TIFF/GIF/PCX/bin, HP ScanJet support) $290 
The Snooper (Ethernet protocol analyzer for Novell NetWare and LAN Manager Networks; capture packets; real-time display) ...... $275 
Report Ease (report writer and mail merge engine, include forms and produce reports from your product; noroyalties) .......... $275 
Turbo TeX (Release 3.0) HF, PS, dot drivers; CM fonts; LaljgX; Metabont) « <9. 6 & se ww he HO Rw ee a ew $250 
Se Wave tools.h+--+ or math.h-+--+ Class Library (extensive docs) . . 2. 2 6 es ee ee ee te each $240 
InControl Toolbox(forms package for Windows; validation functions, date/time, regular expression formatted text control). . ....... $210 
PxSQL (SQL for Borland’s Paradox Engine; ANSI X3.135-1989 SQL-DML standard; network server not required). ........... $180 
c_pslib (PostScript generation library for C programs; includes complete graphics, font, rotation & paragraph support) .......... $170 
C/C+ + Libraries by Code Farms (persistent C structures, ER models, dynamic arrays, disk pager, database functions,much more)... . . $170 
ViewPoint (C++ graphics library; 32-bit color, pattern fill, scroll & bitmap, coordinate ee (shee ee eC eee eee ee $170 
Minix Operating System (Version 1.5; Unix-like operating system, includes manual; specify 5.25” or 3.5” diskettes)... 2... ...... $150 
ViewThieve (relational view of Novell Btrieve databases; includes EZTrieve) . 2... 2. 2. ee $150 
Delorie GCC for MS-DOS (Version 1.05; includes C+ +, assembler, DOS extender, 387 emulation; complete source code and makefiles) . . $150 
Booter Toolkit (floppy disk bootstrap routines, DOS file system, light-weight multitasking, windows, fast memory management) ..... . $120 
Dr. MD (runtime memory analyzer & debugger, find many memory corruption errors; examine memory usage) . ............ $110 
Visions 1.20 (text window user interface management pena includes mouse support and background processing) ............ $105 
SCM (portable Scheme in C, conforms to IEEE and 3.99 specs, garbage collection, mixs with C; SCM3C12/SLIB1B3). ........2.. $100 
PC/IP (CMU/MIT TCP/IP for PCs; Crynwr drivers, NFS server, Bdale mailer, PCRoute/PCBridge, NDIS/ODI drivers, Beholder, more) . . . $100 
Demacs (complete GNU Emacs for DOS; needs djgcc to build; basedon 18.55)... 2. 1. ee ee ee $100 
BASH (key-indexed record management system for DOS and Windows DDL; record locking for concurrent access environments) .... . $95 
Script Interpreter (a command script interpreter for DOS-based systems; C-like script language; lots of features)... ........4.. $90 
tlorC ++. (C++ Class Library for Borland’s Paradox Engine)... «6.34 4 «4 © «8 @ 6 6G * FS Sew wR RS BE $80 
VMEM (virtual memory manager by Blake McBride, Version 3.6, LRU pager, dynamic swap file, image save/restore). .......... $80 
CPPCOMM (Version 2.0; C++ class library for serialcommunications) . . 2... 1 ee ee $75 
eval() (C functions to parse & repeatedly evaluate arithmetic expressions with variables and built-infunctions). . ............ $75 
TreeDraw (PostScript display of labeled hierarchical trees; DOS & Windows screen display included; Moenalgorithm) .......... $75 
BGI Printer Driver Toolkit (hardcopy drivers for Borland BGI graphics library; Epson, IBM, LaserJet, DeskJet, PaintJet, PostScript) . . . . $75 
FinanC (large collection of financial function including bond, inventory, stock portfolio, & cashflow)... . 2... 2... ee ee ee $70 
FlexList (doubly-linked lists of arbitrary data with multiple access methods; specifyCorC++) .. 1... 2... 0.2.05. 02. 000004 $65 
s3d (3-dimensional perspective line drawings with PostScript output) . 2. 6 6 6 we ee $60 
DB (Loose Data Binder; persistent data objects for C++; handles pointers between objects) . 2... 2... 2. eee ee ee $60 
Kier DateLib (all kinds of date manipulation; translation, validation, formatting, & arithmetic) ............2.-505020 208 | $60 
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Color Modeling in 
256-color Mode 


ately, my daughter has wanted 

some fairly sophisticated books 

read to her. Wind in the Willows. 

Little House on the Prairie. Pretty 
heady stuff for a six year old, and some- 
times I wonder how much of it she re- 
ally understands. As an experiment, dur- 
ing today’s reading I stopped whenever 
I came to a word I thought she might 
not know, and asked her what it meant. 
One such word was “mulling.” 

“Do you know what ‘mulling’ means?” 
I asked. 

She thought about it for a while, then 
said, “Pondering.” 

“Very good!” I said, more than a little 
surprised. 

She smiled and said, “But, Dad, how 
do you know that I know what ‘pon- 
dering’ means?” 

“Okay,” I said, “What does ‘ponder- 
ing’ mean?” 

“Mulling,” she said. 

What does this anecdote tell us about 
the universe in which we live? Well, it 
certainly indicates that this universe is 
inhabited by at least one comedian and 
one good straight man. Beyond that, 
though, it can be construed as a para- 
ble about the difficulty of defining things 
properly; for example, consider the 
complications inherent in the definition 
of color on a 256-color display adapter 


Michael Abrash 


such as the VGA. Coincidentally, VGA 
color modeling just happens to be this 
month’s topic, and the place to start is 
with color modeling in general. 


A Color Model 

We've been developing X-Sharp, a real- 
time 3-D animation package, for sever- 
al months now. Last month, we added 
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illumination sources and shading; that 
addition makes it necessary for us to 
have a general-purpose color model, so 
that we can display the gradations of 
color intensity necessary to render illu- 
minated surfaces properly. In other 
words, when a bright light is shining 
straight at a green surface, we need to 
be able to display bright green, and as 
that light dims or tilts to strike the sur- 
face at a shallower angle, we need to 
be able to display progressively dimmer 
shades of green. 

The first thing to do is to select a col- 
or model in which to perform our shad- 
ing calculations, the dot product-based 
stuff I discussed last month. The ap- 
proach we'll take is to select an ideal rep- 
resentation of the full color space and 
do our calculations there, as if we real- 
ly could display every possible color; on- 
ly as a final step will we map each de- 
sired color into the limited 256-color set 
of the VGA, or the color range of what- 
ever adapter we happen to be working 
with. There are a number of color mod- 
els that we might choose to work with, 
but I’m going to go with the one that’s 
both most familiar and, in my opinion, 
simplest: RGB (red, green, blue). In the 
RGB model, a given color is modeled as 
the mix of specific fractions of full in- 
tensities of each of the three color pri- 
maries. For example, the brightest pos- 
sible pure blue is 0.0*R, 0.0*G, 1.0*B. 
Half-bright cyan is 0.0*R, 0.5*G, 0.5*B. 
Quarter-bright gray is 0.25*R, 0.25*G, 
0.25*B. You can think of RGB color 
space as being a cube, as shown in Fig- 
ure 1, with any particular color lying 
somewhere inside or on the cube. 

RGB is good for modeling colors gen- 
erated by light sources, because red, 
green, and blue are the additive pri- 
maries; that is, all other colors can be 
generated by mixing red, green, and 





blue light sources. They’re also the pri- 
maries for color computer displays, and 
the RGB model maps beautifully onto 
the display capabilities of 15- and 24- 
bpp display adapters, which tend to rep- 
resent pixels as RGB combinations in 
display memory. 

How, then, are RGB colors repre- 
sented in X-Sharp? Each color is repre- 
sented as an RGB triplet, with eight bits 
each of red, green, and blue resolution, 
using the structure shown in Listing One 
(page 106). That is, each color is de- 
scribed by three color components— 
one each for red, green, and blue—and 
each primary color component is rep- 
resented by eight bits. Zero intensity of 
a color component is represented by 
the value 0, and full intensity is repre- 
sented by the value 255. This gives us 
256 levels of each primary color com- 
ponent, and a total of 16,772,216 pos- 
sible colors. 

Holy cow! Isn’t 16,000,000-plus col- 
ors a bit of overkill? 

Actually, no, it isn’t. At the eighth 
Annual Computer Graphics Show in 
New York, this past January, Sheldon 








Figure 1; The RGB color cube. 
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Linker, of Linker Systems, related an in- 
teresting tale about color perception re- 
search at the Jet Propulsion Lab back in 
the 70s. The JPL color research folks 
had the capability to print more than 
50,000,000 distinct and very precise col- 
ors on paper. As a test, they tried print- 
ing out words in various colors, with 
each word printed on a background that 
differed by only one color index from 
the word’s color. No one expected the 
human eye to be able to differentiate 
between two colors, out of 50,000,000- 
plus, that were so similar. It turned out, 
though, that everyone could read the 
words with no trouble at all; the human 


Tw 


Debug OS/2 2.0, 
Windows 3.1 
Drivers 


Save up to $500 
on Model IV 
Hardware 


eye is surprisingly sensitive to color gra- 
dations, and also happens to be won- 
derful at detecting edges. 

When the JPL team went to test the 
eye’s sensitivity to color on the screen, 
they found that only about 16,000,000 
colors could be distinguished, because 
the color-sensing mechanism of the hu- 
man eye is more compatible with re- 
flective sources such as paper and ink 
than with emissive sources such as 
CRTs. Still, the human eye can distin- 
euish about 16,000,000 colors on the 
screen. That’s not so hard to believe, if 
you think about it; the eye senses each 
primary color separately, so we’re real- 
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ly only talking about detecting 256 lev- 
els of intensity per primary here. It’s the 
brain that does the amazing part; the 
16,000,000-plus color capability actual- 
ly comes not from extraordinary sensi- 
tivity in the eye, but rather from the 
brain's ability to distinguish between all 
the mixes of 256 levels of each of three 
primaries. 

So it’s perfectly reasonable to main- 
tain 24 bits of color resolution, and X- 
Sharp represents colors internally as 
ideal, device-independent 24-bit RGB 
triplets. All shading calculations are per- 
formed on these triplets, with 24-bit col- 
or precision. It’s only after the final 24- 
bit RGB drawing color is calculated that 
the display adapter’s color capabilities 
come into play, as the X-Sharp function 
ModelColorToColorIndex() is called to 
map the desired RGB color to the clos- 
est match the adapter is capable of dis- 
playing. Of course, that mapping is 
adapter dependent. On a 24-bpp de- 
vice, it’s pretty obvious how the inter- 
nal RGB color format maps to displayed 
pixel colors: directly. On VGAs with 15- 
bpp Sierra Hicolor DACs, the mapping 
is equally simple, with the five upper 
bits of each color component mapping 
straight to display pixels. But how on 
earth do we map those 16,000,000-plus 
RGB colors into the 256-color space of 
a standard VGA? 

This is the “color definition” problem 
I mentioned at the start of the column. 
The VGA palette is arbitrarily pro- 
erammable to any set of 256 colors, with 
each color defined by six bits each of 
red, green, and blue intensity. In X- 
Sharp, the function /nitializePaletteV 
can be customized to set up the palette 
however we wish; this gives us nearly 
complete flexibility in defining the work- 
ing color set. Even with infinite flexi- 
bility, however, 256 out of 16,000,000 
or so possible colors is a pretty puny 
selection. It's easy to set up the palette 
to give yourself a good selection of just 
blue intensities, or of just greens; but 
for general color modeling there’s sim- 
ply not enough palette to go around. 

One way to deal with the limited si- 
multaneous color capabilities of the VGA 
is to build an application that uses on- 
ly a subset of RGB space, then bias the 
VGA’s palette toward that subspace. This 
is the approach used in the DEMO1 sam- 
ple program in X-Sharp; Listings Two 
and Three (page 166) show the versions 
of InitializePalette.) and ModelColor- 
ToColorIndex() that set up and perform 
the color mapping for DEMO1. In DE- 
MO1, three-quarters of the palette is set 
up with 64 intensity levels of each of the 
three pure primary colors (red, green, 
and blue), and then most drawing is 
done with only pure primary colors. The 
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(continued from page 150) 

resulting rendering quality is very good, 
because there are so many levels of each 
primary. 

The downside is that this excellent 
quality is available for only three col- 
ors: red, green, and blue. What about 
all the other colors that are mixes of the 
primaries, like, say, cyan or yellow, to 
say nothing of gray? In the DEMO1 col- 
or model, any RGB color that is not a 
pure primary is mapped into a 2-2-2 
RGB space that the remaining quarter 
of the VGA’s palette is set up to display; 
that is, there are exactly two bits of pre- 
cision for each color component, or 64 
general RGB colors in all. This is gen- 
uinely lousy color resolution, being on- 
ly 1/64th of the resolution we really 
need for each color component. In this 
model, a staggering 262,144 colors from 
the 24-bit RGB cube map to each col- 
or in the 2-2-2 VGA palette. The results 


Character drawn in foreground color 


Character cell (background box) 
drawn in background color 


Figure 2: Solid text. 
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are not impressive; the colors of mixed- 
primary surfaces jump abruptly, badly 
damaging the illusion of real illumina- 
tion. To see how poor a 2-2-2 RGB se- 
lection can look, run DEMO1, and press 
the ‘2’ key to turn on spotlight 2, the 
blue spotlight. Because the ambient 
lighting is green, turning on the blue 
spotlight causes mixed-primary colors 
to be displayed—and the result looks 
terrible, because there just isn’t enough 
color resolution. Unfortunately, 2-2-2 
RGB is close to the best general color 
resolution the VGA can display. 
Another approach would be to set up 
the palette with reasonably good mixes 
of two primaries but no mixes of three 
primaries, then use only two-primary 
colors in your applications (no grays or 
whites or other three-primary mixes). Or 
you could choose to shade only select- 
ed objects, using part of the palette for 
a good range of the colors of those ob- 
jects, and reserving the rest of the palette 
for the fixed colors of the other, non- 
shaded objects. Jim Kent, author of Au- 
todesk Animator, suggests dynamically 
adjusting the palette to the needs of each 
frame, for example by allocating the col- 
ors for each frame on a first-come, first- 
served basis. That wouldn’t be trivial to 
do in real time, but it would make for 
extremely efficient use of the palette. 
The sad truth is that the VGA’s 256- 
color palette is an inadequate resource 
for general RGB shading. The good 
news is that clever workarounds can 
make VGA graphics look nearly as 
good as 24-bpp graphics; the burden 
falls on you, the programmer, to de- 
sign your applications and color map- 
ping to compensate for the VGA’s lim- 
itations. To experiment with a different 
256-color model in X-Sharp, just change 
InitializePalette() to set up the desired 
palette and ModelColorToColorIndex() 
to map 24-bit RGB triplets into the 
palette you’ve set up. It’s that simple, 
and the results can be striking indeed. 


Where to Get X-Sharp 

The full source for X-Sharp is available 
in the file XSHP72.ARC in the DDJ Fo- 
rum on CompuServe, and as XSHARP7 
.ZIP in both the programming/graphics 
conference on M&T Online and the 
graphic.disp conference on Bix. Alter- 
natively, you can send me a 360K or 
720K formatted diskette and an ad- 
dressed, stamped diskette mailer, care 
of X-Sharp, DDJ, 411 Borel Ave., San 
Mateo, CA 94402, and I'll send you the 
latest copy of X-Sharp. There’s no 
charge, but it’d be very much appreci- 
ated if you’d slip in a dollar or so to 
help out the folks at the Vermont As- 
sociation for the Blind and Visually Im- 
paired. 


I'm available on a daily basis to dis- 
cuss X-Sharp on M&T Online and Bix 
(user name mabrash in both cases). 


Fast VGA Text 

This next item comes from The BitMan. 
(That’s how he asked to be described: 
don’t ask me why.) The BitMan passed 
along a nifty application of the VGA’s 
under-appreciated write mode 3 that is, 
under the proper circumstances, the 
fastest possible way to draw text in any 
16-color VGA mode. 

The task at hand is illustrated by Fig- 
ure 2. We want to draw what’s known 
as solid text, in which the effect is the 
same as if the cell around each charac- 
ter was drawn in the background col- 
or, and then each character was drawn 
on top of the background box. (This is 
in contrast to transparent text, where 
each character is drawn in the fore- 
ground color without disturbing the 
background.) Assume that each char- 
acter fits in an eight-wide cell (as is the 
case with the standard VGA fonts), and 
that we’re drawing text at byte-aligned 
locations in display memory. 

Solid text is useful for drawing menus, 
text areas, and the like; basically, it can 
be used whenever you want to display 
text on a solid-color background. The 
obvious way to implement solid text is 
to fill the rectangle representing the 
background box, then draw transparent 
text on top of the background box. 
However, there are two problems with 
doing solid text this way. First, there’s 
some flicker, because for a little while 
the box is there but the text hasn’t yet 
arrived. More important is that the back- 
ground-followed-by-foreground ap- 
proach accesses display memory three 
times for each byte of font data: once 
to draw the background box, once to 
read display memory to load the latch- 
es, and once to actually draw the font 
pattern. Display memory is incredibly 
slow, so we'd like to reduce the num- 
ber of accesses as much as possible. 
With The BitMan’s approach, we can 
reduce the number of accesses to just 
one per font byte, and eliminate flick- 
er, too. 

The keys to fast solid text are the 
latches and write mode 3. The latches, 
as you may recall from earlier discus- 
sions in this column, are four internal 
VGA registers that hold the last bytes 
read from the VGA’s four planes; every 
read from VGA memory loads the latch- 
es with the values stored at that display 
memory address across the four planes. 
Whenever a write is performed to VGA 
memory, the latches can provide some, 
none, or all of the bits written to mem- 
ory, depending on the bit mask, which 
selects between the latched data and 
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GRAPHICS PROGRAMMING 


(continued from page 152) 

the drawing data on a bit-by-bit basis. 
The latches solve half our problem; we 
can fill the latches with the background 
color, then use them to draw the back- 
ground box. The trick now is drawing 
the text pixels in the foreground color 
at the same time. 

This is where it gets a litthe compli- 
cated. In write mode 3 (which inciden- 
tally is not available on the EGA), each 
byte value that the CPU writes to the 
VGA does not get written to display 
memory. Instead, it turns into the bit 
mask. (Actually, its ANDed with the Bit 
Mask register, and the result becomes 
the bit mask, but we’ll leave the Bit 
Mask register set to OXFF, so the CPU 
value will become the bit mask.) The 
bit mask selects, on a bit-by-bit basis, 
between the data in the latches for each 
plane (the previously loaded back- 
ground color, in this case) and the fore- 
ground color. Where does the fore- 
eround color come from, if not from the 
CPU? From the Set/Reset register, as 
shown in Figure 3. Thus, each byte writ- 
ten by the CPU (font data, presumably) 
selects foreground or background col- 
or for each of eight pixels, all done with 
a single write to display memory. 

I know this sounds pretty esoteric, 
but think of it this way. The latches hold 
the background color in a form suitable 
for writing eight background pixels (one 





Byte written to VGA 
m 


emory by CPU 


AND bit-mask register and CPU data 


full byte) at a pop. Write mode 3 allows 
each CPU byte to punch holes in the 
background color provided by the latch- 
es, holes through which the foreground 
color from the Set/Reset register can 
flow. The result is that a single write 
draws exactly the combination of fore- 
ground and background pixels de- 
scribed by each font byte written by the 
CPU. It may help to look at Listing Four 
(page 167), which shows The BitMan’s 
technique in action. And yes, this tech- 
nique is absolutely worth the trouble; 
it's about three times faster than the fill- 
then-draw approach described above, 
and about twice as fast as transparent 
text. So far as I know, there is no faster 
way to draw text on a VGA. 

It’s important to note that The Bit- 
Man’s technique only works on full 
bytes of display memory. There’s no 
way to clip to finer precision, the back- 
ground color will inevitably flood all of 
the eight destination pixels that aren't 
selected as foreground pixels. This 
makes The BitMan’s technique most suit- 
able for monospaced fonts with char- 
acters that are multiples of eight pixels 
in width, and for drawing to byte- 
aligned addresses; the technique can be 
used in other situations, but is consid- 
erably more difficult to apply. 

At this point, some of you are no 
doubt nodding your heads and saying, 
“Yes, I see how that would work.” Oth- 


Set/reset register 


ag oe 
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Selects latch bit 
where bit-mask bit is 
0; set/reset bit where 
bit-mask bit is 1. 
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Figure 3: Data path in write mode 3. 
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ers are probably muttering, “Well, heck, 
I knew that; tell me something new.” 
Then there are the rest of you, the VGA 
neophytes, the ones with glazed eyes, 
who think this technique sounds inter- 
esting, but understand maybe 30 per- 
cent of what you just read. Where can 
you turn for help? 
Read on. 


Useful VGA Reading 

For years, ’ve recommended Richard 
Wilton’s Programmer's Guide to PC and 
PS/2 Video Systems (Microsoft Press, 
1987, $24.95, ISBN 1-55615-103-9) as a 
VGA-programming reference, not be- 
cause it’s perfect, but because it was the 
only VGA book I knew of that was 
good enough to be useful. I’ve added 
another book to my good-enough-to- 
get list: Programmer's Guide to the EGA 
and VGA Cards, Second Edition, by 
Richard Ferraro (Addison-Wesley, 1990, 
$29.95, ISBN 0-201-57025-4). This 1000- 
plus page tome has a wide variety of 
valuable VGA information, ranging from 
registers to BIOS functions to the spe- 
cifics of seven manufacturers’ Super- 
VGA implementations, and it has plen- 
ty of good figures. This is, without 
question, a useful book. However, it is 
not (sigh), the ultimate VGA reference 
I’ve awaited for five years. The book is 
not error-free, especially regrettable in 
a second edition, for example, the po- 
larity of the bits in the Color Don’t Care 
register are reversed in the discussion 
of read mode 1. Also, although write 
mode 3 and the latches are covered, 
they’re discussed in considerably less 
detail than I’d like to see; you’d have a 
tough time figuring out why The Bit- 
Man’s technique works from this book 
alone. And surely Ferraro knows that 
you have to read display memory to 
load the latches before the bit mask can 
do its job of protecting selected pixels 
within a destination byte, because he 
shows line-drawing code that does just 
that. Still, he keeps saying that the bit 
mask keeps destination pixels from be- 
ing modified, as if that happens even if 
you don’t read display memory first. 
Nonetheless, I’ve found this book use- 
ful when I’ve reached for it, and I’ve 
found myself reaching for it increasing- 
ly often, and that’s the real test of any 
reference. If you’re a PC graphics pro- 
grammer, you should probably have this 
book on your shelf. 


DDJ 
(Listings begin on page 166.) 
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Literate 
Programming 


n much the same way that General 
Francisco Franco is still dead, vol- 
ume 4 (Combinatorial Algorithms) 
of Donald Knuth’s projected seven- 
volume Art of Computer Programming 
is still not out. Neither are volumes 5 
(Syntactical Algorithms), 6 (Theory of 
Languages), or 7 (Compilers). 

DDJ readers are probably familiar with 
the reason why: Knuth has been on a 
ten-year detour from the A7t of Com- 
puter Programming, working in the field 
of computer typesetting (TeX) and ty- 
pography (METAFONT). In addition to 
producing the TeX and METAFONT soft- 
ware itself, Knuth has used this software 
to produce an attractive five-volume se- 
ries, Computers and Typesetting, that in- 
cludes not only the documentation for 
TeX and METAFONT, but also their 
source code. 

What DDJ readers may not be famil- 
iar with is that this source code is writ- 
ten in a programming language called 
WEB. But when I say “written,” I real- 
ly do mean written: the source code, 
and the written description of it, are one 
and the same. WEB is a language, quite 
similar to Pascal (there’s also CWEB, 
which is quite similar to C), which 
makes it possible to merge the exe- 
cutable source for a system with its de- 
scription. More importantly, it allows 


Andrew Schulman 


you to construct and present the source 
in an order which makes “psychologi- 
cal” sense. Using such a system, soft- 
ware “authors” really do become au- 
thors, concerned with writing and 
presenting code in a way that makes 
sense, not so much to the compiler, but 
to the reader. 

Even if you’re not interested in type- 
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setting or typography, take a look some 
time at Knuth’s TeX: The Program (Vol- 
ume B of Computers and Typesetting) 
and METAFONT: The Program (Volume 
D). You'll not find anywhere else such 
detailed presentations of the entire 
source code—warts and all—for a large 
program. It’s instructive to consider what 
life would be like if the source code for 
your favorite system were available as 
a WEB. 

What inspired Knuth to issue these 
600-page hardcover books of source 
code? Literate Programming, a recent- 
ly issued collection of essays by Knuth 
and others from 1974 to 1989, contains 
a fascinating answer: 


Tony Hoare provided a special impetus 
for WEB when he suggested in 1978 that 
I should publish my program for TeX. 
Since very few large-scale software sys- 
tems were documented in the literature, 
he had been trying to promote the pub- 
lication of well-written programs. Hoare’s 
suggestion was actually rather terrifying 
to me, and I’m sure he knew that he was 
posing quite a challenge. As a professor 
of computer science, I was quite com- 
fortable publishing papers about toy 
problems that could be polished up nice- 
ly and presented in an elegant manner; 
but I had no idea how to take a piece of 
real software, with all the compromises 
necessary to make it useful to a large 
class of people on a wide variety of sys- 





tems, and to open it up to public scruti- 
ny. How could a supposedly respectable 
academic, like me, reveal the way he ac- 
tually writes large programs? 


This same challenge faces anyone 
who has ever tried to write about soft- 
ware: Only small programs seem ex- 
plicable in the course of an article or 
even a reasonably sized book. Yet gen- 
uine software tends not to be small, and 
generally seems to consist mostly of ug- 
ly distractions from whatever major 
points you’re trying to make. Small pro- 
grams are toys, and genuine programs 
seem impossible to explain in depth. Of 
course, one could present only a top- 
down view of a large program, ignor- 
ing the mass of details; but it is in these 
details that the program’s true worth 
(certainly its monetary worth!) probably 
resides. 

Knuth tackles this problem with the 
idea of programs as webs: 


When I first began to work with the ideas 
that eventually became the WEB system, 
I thought that I would be designing a lan- 
guage for “top-down” programming, 
where a top-level description is given first 
and successively refined. On the other 
hand I knew that I often created major 
parts of programs in a “bottom-up” fash- 
ion, starting with the definitions of basic 
procedures and data structures and grad- 
ually building more and more powerful 
subroutines. I had the feeling the top- 
down and bottom-up were opposing 
methodologies: one more suitable for 
program exposition and the other more 
suitable for program creation. 

But after gaining experience with WEB, 
I have come to realize that there is no 
need to choose once and for all between 
top-down and bottom-up because a pro- 
gram is best thought of as a web instead 
of as a tree.... 

When I’m writing a longish program 
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... Linvariably have strong feelings about 
what part of the whole should be tack- 
led next. For example, I'll come to a point 
where I need to define a major data struc- 
ture and its conventions, before I'll feel 
happy about going further. My experi- 
ences have led me to believe that a per- 
son reading a program is, likewise, ready 
to comprehend it by learning its various 
parts in approximately the order in which 
it was written.... Sometimes the “correct” 
order is top-down, sometimes it is bot- 
tom-up, and sometimes it’s a mixture; but 
always it’s an order that makes sense on 
expository grounds. 

Thus the WEB language allows a per- 
son to express programs in a “stream of 
consciousness” order.... the fact that 
there’s no need to be hung up on the 
question of top-down versus bottom- 
up—since a programmer can now view 
a large program as a web, to be explored 
in a psychologically correct order—is 
perhaps the greatest lesson I have learned 
from my recent experiences. 


In other words, there’s a way to pre- 
sent genuine, large programs, give the 
reader an understanding of how the en- 
tire system fits together, and still not 
brush aside messy issues like error re- 
covery, special cases, system depen- 
dencies, tweaks for performance, hacks, 
kludges, and all the other seemingly 
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nonalgorithmic issues that make up the 
bulk of a genuine program. Such de- 
tails are crucial to one’s understanding. 
They can't be “black boxed.” 

This jumping around from top-down 
to bottom-up and back to top-down isn’t 
just a matter of how source code gets 


Literate 
Programming brings 
back what a 
pleasure reading 
source code can be 


presented, either. It cuts to the root of 
programming itself. As Knuth notes in 
an amazing essay from 1974 in this col- 
lection (“Structured Programming with 
go to Statements”): 


I have felt for a long time that a talent 
for programming consists largely of the 
ability to switch readily from microscop- 
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ic to macroscopic views of things, i.e., to 
change levels of abstraction fluently. 


The name WEB of course comes 
straight from this notion of a program 
as a tangle of high-level and low-level 
issues. 

Above all, what comes through here 
is the notion of writing a program as 
writing. TeX, METAFONT, and WEB all 
fit together as part of a grand attempt 
(apparently partially inspired by Arthur 
Koestler’s Ghost in the Machine) to 
break down the division between doc- 
uments and programs, or at least be- 
tween documentation and programs. 

One of the key points is the need for 
a program and its documentation to be 
written by the same people. This sounds 
like an impossible luxury, but in many 
cases the awful state of documenta- 
tion—and the inexplicable misfeatures 
of a program—are the result of the 
“doc” group not knowing the program 
they have been hired to describe, and 
the programmers not thinking about 
how one would actually describe or use 
the program. 

What is missing is “the discipline of 
simultaneously writing and describing 
a program.” “Manual writing provides 
an ideal incentive for system improve- 
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ments, because you discover and re- 
move glitches that you can’t justify in 
print.” Basically, if you can’t explain it 
in a nonembarrassing way, then it 
shouldn’t be in the product. “The de- 
signer of a new system must not only 
be the implementor and the first large- 
scale user; the designer should also write 
the first user manual.” This is just a vari- 
ation on the age-old theme that the best 
way to learn is to teach, that the best 
way to understand something is to have 
to explain it. Again, we see that writing 
software means writing. 

Of course, where there is writing, 
there must be criticism. Not so much 
“criticism” in the sense of a “this is good, 
this is bad” review, but what in literary 
criticism is called “close reading.” So it 
is fitting that one chapter of Literate 
Programming, a reprint from a column 
by Jon Bentley on WEB, contains first 
a program by Knuth and then a detailed 
criticism by Doug McIlroy (who ends 
up calling Knuth’s sample program “a 
sort of industrial-strength Fabergé egg”). 

One of the most fascinating parts of 
Literate Programming is a 100-page sec- 
tion called “The Errors of TeX,” which 
“describes the milieu of literate pro- 
gramming, by tracing the history of all 
changes made to TeX as that system 
evolved.” This includes a complete er- 
ror log for TeX from 1978 to 1991. In- 
terestingly, 


C' Language 





#52 


Q: Is it safe to use calloc’s zero-fill 
guarantee for pointer and floating- 
point values? 


A: calloc(m, n) is essentially equivalent 
to p = malloc(m * n); memset(p, 0, 
m *n);. The zero fill is all-bits-zero, 
and does not therefore guarantee 
useful zero values for pointers or 
floating-point values. 


#54 


Q: [heard that structures could be as- 
signed to variables and passed to and 
from functions, but K&R I says not. 


A: These operations are supported by all 
modern compilers. 


Copyright © 1992 Steve Summit 
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...if you ask me whether keeping such a 
log has helped me learn how to reduce the 
number of future errors, my answer has to 
be No. I kept a similar log for errors in 
METAFONT, and there was no perceivable 
reduction. I continue to make the same kinds 
of mistakes. 


Oh well. 

It seems unfortunate that so much of 
this book focuses on Pascal. However, 
Chapter 12 presents the word count 
(wc) program from UNIX, rewritten in 
CWEB to demonstrate “literate pro- 
gramming” in C. While wc is, of course, 
a very simple program, Knuth notes that 
his “fondest hope is that readers who 
look at Chapter 12 will realize how won- 


COMPUTER 


LANGUAGE 





#include <stdlib.h> 


struct list { struct list *next; 


void freeup() 
{ 
Struct List. *p; 
for( p = head; p; 
free (p); 


p = p->next ) 


derful it would be if the entire UNIX 
system and its successors were written 
in the style of Chapter 12 or something 
similar.” 

Actually, it also made me wish that 
volumes 1—3 of the A7t of Computer Pro- 
gramming had been written in the style 
of Chapter 12 or something similar, or 
at least in the style of something other 
than the MIX assembly language. At any 
rate, Literate Programming brings back 
what a pleasure reading source code 
can be. 


DDJ 


Vote for your favorite feature/article. 
Circle Reader Service No. 14. 


5.0 presents 
C Bug # 644 


ass 


} *head = NULL; 


SRR RE 
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There’s something amiss with this function even though it will "work" most of 
the time. Can you or your compiler spot the problem? Call if you need a hint. 


Refer to Bug # 644. 


PC-lint will catch this and many other 
C bugs. Unlike your compiler, PC-lint 
looks across all modules of your 
application for bugs and inconsistencies. 


New — Optional Strong Type Checking 
and variables possibly not initialized. 


More than 330 error messages. More 
than 105 options for complete 
customization. Suppress error messages, 
locally or globally, by symbol name, by 
message number, by filename, etc. 
Check for portability problems. Alter 
size of scalars. Adjust format of error 
messages. Automatically generate ANSI 
prototypes for your K&R functions. 


Attn: Power users with huge programs. 


PC-lint 386 uses DOS Extender 
Technology to access the full storage 
and flat model speed of your 386. Now 
fully compatible with Windows 3.0 
and DOS 5.0 


PC-lint 386 — $239 
PC-lint DOS - OS/2 — $139 





Girnoe! Software 


3207 Hogarth Lane, Collegeville, PA 19426 


CALL TODAY (215) 584-4261 Or FAX (215) 584-4266 
30 Day Money-back Guarantee. 


PA add 6% sales tax. 


PC-lint and FlexeLint are trademarks of Gimpel Software 
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Call Greg Messina today at (415) 358-9500 
to develop a custom advertising campaign 
tailored to your marketing needs. 


AIB-PCt 
Advanced Interface Board for IBM-PC 
and Compatables 
Works with XT, AT, 386, 486 
using the ISA bus. 
ANALOG: 8 software selectable analog inputs 
12 bit ADC (8 microsecond conv. time) 
6 software selectable input ranges 
Dual 12 bit DAC (-5v to +5v) 
DIGITAL: two 8 bit ports for digital I/O 
TIMER: three 16 bit programmable timers 
price: $260.00 
Sunset Laboratory (503) 357-5151 
2017 19th Ave. Forest Grove, OR 97116 
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Converts FORTRAN to C++! 


C++ is ideal for object-oriented solutions, high level simulations 
and robust code! 


For_C++ generates full C++ prototypes while optimizing code for 
readability and ease of maintenance. |/O and character trans- 
lations use standard functions, where possible, and function 
arguments are passed either by value or reference. 


Take Advantage of C++ Power 


C++ offers superior handling of complex data types, enhanced 
support for call by reference, and retains more FORTRAN style 
than in C. While FOR_C++ does not object-orient your 
FORTRAN, it uses advanced C++ constructs for optimum trans- 
lations. Code looks hand-written, is compilable and ready to run. 


Call today for more information! 


PS. Ask about FOR_C 
and FOR_STRUCT—our 
other code translation 
and maintenance tools. 


COBALT BLUE 


COBALT BLUE, INC. 

875 OLD ROSWELL ROAD 
SUITE D-400 

ROSWELL, GA 30076, USA 
TEL (404) 518-1116 

FAX (404) 640-1182 





CIRCLE NO. 852 ON READER SERVICE CARD 
160 








gm Read & write Intel or Motorola format or convert between each. 

m Compress & Decompress Types (1)BitPacking & (32773)PackBits RLL & (5)LZW. 
Add, Delete or Modify Tags or Tag values in any format . 

View or modify any byte anywhere in Hex or Decimal. 

@ View Data Stream compressed or uncompressed in Hex or Decimal. 

@ Convert to E.P.S 3.0 or straight PostScript file. 

™ Control the Halftone Frequency, Angle & Dot. 

m@ View and or modify color maps, tone maps and color curves. 

@ Adjust brightness, contrast and gamma. 

@ View and edit the E.P.S or straight PostScript file text. 

@ Print TIFF file reports to PostScript or LaserJet Printers (PCL 4) & (PCL 5). 
m@ TIFF Engineer is $149.00. 


m Complete PostScript writing and page design environment. 

@ Design tools (square, line, circle, arc, text,images, rulers,zoom,etc.). 

@ Parallel or serial communication (serial-full duplex for error communication). 
@ Send Postscript code in batch or single step mode. 

@ Set multiple breakpoints and view error codes in separate window. 

@ View and edit object list and data. 

@ View and Edit generated PostScript Code in separate window. 

m@ Output code as straight Postscript file or E.P.S. 

m 7/FF file support (subset of TIFF ENGINEER) control Frequency, Angle and Dot. 
m@ Spot color support - generate Knockouts and specify Trap control. 

m@ Produce Four Color (cymk) separations with GCR & UCR. 

m@ Supports PostScript Level 1 & Level 2 code generation. 

@ Script Engineer is $195.00. 


TIFF ENGINEER & SCRIPT ENGINEER requires 640x480, 800x600 or 1024x768 
microsoft compatible mouse, harddisk and DOS 2.11 or higher. 
PostScript is a registered trademark of Adobe Systems Incorporated. 
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Thompson Toolkit+ AWK Compiler 


More Power: over 100 UNIX compatible utilities, a command shell, and 
our ultra-powerful AWK Compiler make short work of many programming needs. 


More Features: our AWK Compiler is a real language with many new 
functions. Similar to C, but with searching, sorting, string manipulation and pat- 
tern matching built-in. Compiled AWK programs can call functions written in C, 
use extended expanded memory transparently, and be distributed royalty-free. 


More Fun: Fast, easy to learn and use. 30 day Money Back Guarantee. 


Thompson 

Automation 

For Complete Literature 
Call 1-800-944-0139 


Out of USA: Call 503-224-1639 or FAX 503-224-3230 
Or write: 5616 SW Jefferson / Portland, OR 97221 USA 
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F H EF Receive your own personal profile and narrative report on 
THE STRONG INTEREST INVENTORY 


The Strong is periodically revised to reflect social and occupational 
changes. In order to update the Strong profiles of Computer 
Programmers and Systems Analysts we invite Programmers and 
Analysts with at least 3 years experience to complete the Strong at 
no charge and receive a report comparing your interests to those of 
others. This profile is confidential and your name will not be used 
for any other purpose. For 60 years the Stanford University Press 
has overseen publication of the Strong Interest Inventory, the most 
respected and widely used interest inventory in the world. The 
Strong compares a person’s interests in occupations, leisure activi- 
ties, school subjects and types of people with the interests of people 
who are successfully employed in a wide variety of occupations. 
Please be sure to specify your profession—Computer Programmer 
or Systems Analyst on the RS card. 

To find out more, call (800) 526-0050 ext. 108. 
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James Oxley wrote his 
first PC game with 


™ 
Fastgraph 
you can too for only 
$169 
Unlock the secrets of DOS graphics programming with 


the graphics library commercial games programmers 
have been using for years. 


C ¢ C++ ¢ Pascal e Basic e FORTRAN 
Voice (702) 735-1980 


FAX (702) 735-4603 
BBS (702) 796-7134 


Demos available 










Ted Gruber Software 
PO Box 13408 
Las Vegas, NV 89112 


Visa/MC/COD No royalties 
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SuperHackers 


D. E. Shaw & Co. is a small (several dozen employees), highly 
capitalized (over $300 million in partners’ equity), extremely suc- 
cessful Wall Street firm specializing in quantitative finance and 
computational trading. Our technical staff includes both Ph.D.- 
level researchers recruited from Stanford, MIT, and other leading 
computer science departments and extraordinarily talented B.S.- 


and M.S.-level system designers and “superhackers”’. It is our 
practice to compensate unusually gifted individuals at a level 
exceeding that of the market. Applicants are invited to send 
resumes to the Recruiting Department. 


D. E. SHAW & Co. 


39TH FLOOR, TOWER 45 
120 W. 45TH STREET 
NEW YORK, NY 10036 
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¢- VERSION 3.0! - 


The Premier Worldwide Software Distribution System for 
Windows ¢ DOS ¢ OS/2 and soon for PM, Unix and more! 


Thanks to our many friends around the 
world who helped us enhance INSTALIT to 
version 3.0. Yes, we put almost all of your 
wishes in 3.0! Although the INSTALIT script 
language is now more complete and 
powerful than ever, anyone can use 


INSTALIT in scriptless mode for any size of 
application. 

Our annual maintenance customers already 

have their automatic upgrade. If you are 

not an INSTALIT shop, why not join us now 

as we move together to version 

4.0 and beyond? 


30-day FREE trial with a 90 day money back guarantee 
BBS Demos (205) 880-8785 VISA-MC-AMEX-PO 
DOS-OS/2 $149 * Windows $199 
(Specify national language) 
Source and multiple translations available 


P.O. Box 16078 Huntsville, AL 35802 U.S.A. 


Voice/Fax: (205) 880-8782 
Orders: (800) 448-4154 


Germany, Austria, Switzerland (49) 07-2361500 
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UNIX/MOTIF 
COMPLETE DEVELOPMENT WORKSTATION 


witeo$ 3,750.00 ae 


& TESTED 


INTERACTIVE V3.0 WORKSTATION & MOTIF DEVELOPER 


- 386/33 MHZ CPU 
- 8 MB RAM 


“14 coLonsvea (rozexa6ey| HARDVARE ONL 
- 1.2 & 1.44 MB FLOPPIES 
- ENHANCED KEYBOARD 
- ANSI C - 2 SERIAL; 1 PARALLEL PORTS $1,700.00 
- LOOKING GLASS - 16 BIT ETHERNET ADPTR 
- VPIX - MOUSE 
ALL OTHER CONFIGURATIONS AVAILABLE - CALL FOR PRICING 
Application & Network Strategies, Inc, BRAM ae ara Aa asa a 


iverside Ave, Sui the 
eee (800)959-2070 
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BGI Printer Driver Toolkit - 8G! printer drivers for Borland's BGI graphics f 
library. Epson/IBM 9 pin, Epson 24 pin, LaserJet, DeskJet, PaintJet, Postscript, f 
others. -Not a screen dump - load our drivers with BGI's initgraph and get full f 
hardcopy device resolution. Supports TC, TC++, BC++, Turbo Pascal. $89.95 


BGI For Windows - BG! compatible interface to Windows 3.x GDI. Port your f 
Borland DOS BGI graphics routines effortlessly to Windows. Full stroke font, 256 § 
color, and high resolution hardcopy support. Supports TCW, BC++, TPW. $89.95 





PC Timer Tools - Microsecond resolution timing, delays, interrupt profiling, 
asynchronous thread scheduler, and timer tick interrupt management. Supports TC, 
TC++, BC++, MSC, Intel 386 Code Builder, Zortech, Turbo Pascal. $69.95 


PC Timer Objects - OOP implementation of PC Timer Tools. High resolution | 
timing, delays, asynchronous thread scheduler, timer tick interrupt management. 
Supports TC++, BC++, MSC++, Zortech C++, Turbo Pascal Objects. $69.95 
All Toolkits Include Full Source & Object Code Distribution License. 
VISA & MasterCard Accepted. Add $4.00 Shipping USA, $7.00 Elsewhere. 
Our 30 Day "No Questions Asked" Return Policy Guarantees Satisfaction. 
Ryle Design PO Box 22, Mt. Pleasant, Michigan 48804 USA 


Voice/Fax: 517.773.0587 BBS: 517.772.2393 C\iS: 73047,1765 
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Would you know one 
if you saw one? 





















Protect the integrity of your C-language programs with C-Verify™ 
and C-Debug;” two utilities from Softran™ 

C-Verify, Softran’s newest product, is a coverage analyzer for 
testing C-language programs. It generates a report of every branch 
taken—and not taken—through your program, assuring you of 100 
percent test coverage! 

C-Debug is a pointer checker which, according to Unix Review, 
“excells at finding pointer problems in code.” When used with 
C-Verify, it offers you the highest quality assurance possible in 
C-language programming. C-Verify and C-Debug can be run at the 
same time and work with all C compilers. Both are available for MS- 
DOS and UNIX systems. To order, call 1-800-462-3932. (In Canada, 
call 1-708-505-3456. ) 


Softran 


CORPORATION 






Softran Corporation 
One Naperville Plaza, Suite 455 
Naperville, IL USA 60563 
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The Measure of 
a Great Program. 


PC-METRIC’: 
The Measurement Tool For Serious Developers. 


Break Into Windows the Easy way! 


Flare} 


The New Windows Programming Language . 








PC-METRIC is the software measurement 
tool that measures your code and identi- 
fies its most complex parts so you can 
spend your time working in the areas 
most likely to cause problems. 


PC-METRIC is fast, user-configurable, 
and includes a wide array of commonly 
accepted measurement standards. 


Plus, versions of PC-METRIC are avail- 
able to support virtually every popular 
programming language. 





were source neee works at a higher Peel and is much easier 
to write, read, and maintain . .. strikes a nice balance between 
power and simplicity... i 

Jeff Duntemann, Editor in Chief, PC TECHNIQUES 


"Liana's strength is that it allows C and C++ programmers to 
begin writing Windows programs in very short order." . 
Ron Burk, Editor, Windows/DOS Developer’ sJ ournal - 
Interpretive # generates .EXE files # syntax similar to C and C++ @ less error- 
prone # single inheritance OOP # 90 high-level classes + DLL # DDE # MDI # 
string operations # automatic memory management + don't have to declare 


variables # no need for header or MAKE files # screen capture utility # over 60 
fully-documented sample programs on-line manual + free monthly newsletter. 


Buy the low cost Personal Developer for only $129 
Or the royalty-free Professional Developer for $495 


Call 1-800-786-9505 Now! 


(or e-mail Compuserve [70642,2662] ) 









A Great Value By Any Measure. 
PC-METRIC’s price is only $199, 
and it comes with a 30-day money- 
back guarantee. Multiple user dis- 
counts are available, as well as site 
licenses and complete source code. 


Order Now! Call (503) 829-7123 








SET LABORATORIES, INC 


COMPUTER “Quality Tools For Software vata 
LANGUAGE P.O. Box 868 


and ask us your toughest technical questions! 
PRODUCTIVITY _ a Mulino, OR 97042 7 
Phone: (503) 829-7123 


ee Base Technology 1543 Pn St, Boulder, CO 80302 
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vx Base ~ 


Take Control! 


Visual Basic + vxBase Dynamic Link Library = 
xBase apps for Windows in hours. Functions 
vxAppendBlank thru vxZap (110 in all). 


Incredible browse object supports user tables, 
onscreen editing, visual relationships, user 





for | 


menus. ian | |_p es 
Clipper ntx and memo file compatible. Full : ara al 
multi-user, multi-tasking. Memo editor. 3d ‘tt Memo, MDX OQ Twice the e Speed & & 


OC Libraries for Paradox 
| _O Windows(DDL)and 
VisualBasic Support - 


O Multi-Platform/ 
C Source Available 


| tO Fall Network i Suppprt : 


Menus, dialogs, messages in English, French, 
German, Spanish, and Italian! 


No royalties. Sixty day money back guarantee! 
DataWorks xBase tool included along with 244 
page manual. 


Regular $295 U.S. 
Introductory offer $195 + $15 S&H. Visa & M/C 


vxBase #200, 10310 - 176 Street 
Edmonton, AB, Canada TSS 1L3 
Phone (403) 489-5994 
Fax(403) 486-4335 







0 TI HNCAL DETAILS 708/924. 3030 Doc. ‘NO. 989823 


‘opia International Ltd. 
Wheatgn. | linois 60187 708 /682- B898 
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STRUCTURED PROGRAMMING 


Listing One (Text begins on page 141.) 


a i ec ca a haa a are } 
{ MORTGAGE J 
{ By Jeff Duntemann -- From DDJ for August 1992 --Last Updated 5/2/92 } 
{ Major update: 3/25/92: Added all the rigmarole to make the TMortgage type } 
{ streamable. It now descends from TObject and uses the Objects unit. I } 
{ also added the registration record and the Load and Store methods. } 
ewww nnn wen wee ne ee = } 


UNIT Mortgage; 
INTERFACE 


USES Objects; 


TYPE 

Payment = RECORD { One element in the amort. table. } 
PayPrincipal : Real; 
PayInterest : Real; 
PrincipalSoFar : Real; 
InterestSoFar : Real; 
ExtraPrincipal : Real; 
Balance : Real; 

END ; 
PaymentArray ARRAY[1..2] OF Payment; { Dynamic array! } 


PaymentPointer = “PaymentArray; 


PMortgage = “TMortgage; 


TMortgage = 
OBJECT (TObject) ‘ Must descend from TObject to be streamable } 
Periods : Integer; { Number of periods in mortgage } 
PeriodsPerYear : Integer; { Number of periods in a year } 
Principal : Real; { Amount of principal in cents } 
Interest : Real; { Percentage of interest per *YEAR*} 


MonthlyPI =: Real; { Monthly payment in cents } 
Payments : PaymentPointer; { Array holding payments J 
PaymentSize : LongInt; { Size in bytes of payments array } 


CONSTRUCTOR Init (StartPrincipal : Real; 
StartInterest : Real; 
StartPeriods : Integer; 


StartPeriodsPerYear : Integer) ; 
CONSTRUCTOR Load(VAR S : TStream) ; 
PROCEDURE SetNewInterestRate(NewRate : Real); 
PROCEDURE Recalc; 
PROCEDURE GetPayment(PaymentNumber : Integer; 

VAR ThisPayment : Payment) ; 
PROCEDURE ApplyExtraPrincipal(PaymentNumber : Integer; 
Extra : Real); 

PROCEDURE RemoveExtraPrincipal(PaymentNumber : Integer) ; 
PROCEDURE Store(VAR S : TStream) ; 
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DESTRUCTOR Done; VIRTUAL; 
END; 
CONST 
RMortgage : TStreamRec = 
(ObjType : 1280; 
VMTLink : Ofs(TypeOf (TMortgage) *) ; 


Load : @TMortgage.Load; 
Store : @TMortgage.Store) ; 
IMPLEMENTATION 
FUNCTION CalcPayment (Principal,InterestPerPeriod : Real; 
NumberOfPeriods : Integer) : Real; 
VAR 
Factor : Real; 
BEGIN 


Factor := EXP(-NumberOfPeriods * LN(1.0 + InterestPerPeriod)) ; 
CalcPayment := Principal * InterestPerPeriod / (1.0 - Factor) 
END ; : 


CONSTRUCTOR TMortgage.Init (StartPrincipal : Real; 
StartInterest : Real; 
StartPeriods : Integer; 
StartPeriodsPerYear : Integer); 
VAR 
I: Integer; 
InterestPerPeriod : Real; 
BEGIN 
{ Set up all the initial state values: } 
Principal := StartPrincipal; 
Interest := StartInterest; 
Periods := StartPeriods; 
PeriodsPerYear := StartPeriodsPerYear; 


{ Here we calculate the size that the payment array will occupy. } 
{ We retain this because the number of payments may change...and } 
{ we'll need to dispose of the array when the object is ditched: } 
PaymentSize := SizeOf(Payment) * Periods; 


{ Allocate payment array on the heap: } 
GetMem(Payments, PaymentSize) ; 


{ Initialize extra principal fields of payment array: } 
FOR I := 1 TO Periods DO 

Payments*[I].ExtraPrincipal := 0; 
Recalc; { Calculate the amortization table } 


END ; 
CONSTRUCTOR TMortgage.Load(VAR S : TStream) ; 
BEGIN 
S.Read(Periods, Sizeof (Integer) ) ; 
S.Read(PeriodsPerYear ,SizeOf (Integer) ) ; 
§.Read(Principal, SizeOf (Real) ); 
S.Read(Interest, SizeOf (Real) ) ; 
S.Read(MonthlyPI, SizeOf (Real) ) ; 
S.Read(PaymentSize, SizeOf(LongInt)) ; 
{ Note that we *don't* try to read a pointer in from the stream. That would } 
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{ be meaningless; instead, we allocate heap space for the payments array 
{ with GetMem and assign the returned pointer to Payments: } 
GetMem (Payments, PaymentSize) ; 


S.Read(Payments”, PaymentSize) ; 
END; 
PROCEDURE TMortgage.Store(VAR S : TStream) ; 
BEGIN 


S.Write (Periods, Sizeof (Integer) ); 

S.Write (PeriodsPerYear,SizeOf (Integer) ); 

S.Write (Principal, SizeOf (Real) ) ; 

S.Write(Interest, SizeOf (Real) ) ; 

S.Write(MonthlyPI, SizeOf (Real) ) ; 

{ Note that we *don't* store the pointer to the payments array! } 
{ A pointer (i.e., a heap address) is meaningless written to disk.} 
S.Write(PaymentSize, SizeOf(LongInt)); 


S.Write(Payments”’, PaymentSize) ; 
END ; 
PROCEDURE TMortgage.SetNewInterestRate(NewRate : Real); 
BEGIN 

Interest := NewRate; 

Recalc; 
END ; 
{ This method calculates the amortization table for the mortgage. } 
{ The table is stored in the array pointed to by Payments. } 
PROCEDURE TMortgage.Recalc; 
VAR 

I: Integer; 

RemainingPrincipal : Real; 

PaymentCount : Integer; 

InterestThisPeriod : Real; 

InterestPerPeriod : Real; 

HypotheticalPrincipal : Real; 
BEGIN 

InterestPerPeriod := Interest/PeriodsPerYear; 

MonthlyPI := CalcPayment (Principal, 

InterestPerPeriod, 
Periods) ; 


{ Round the monthly to cents: } 
MonthlyPI := int(MonthlyPI * 100.0 + 0.5) / 100.0; 


{ Now generate the amortization table: } 
RemainingPrincipal := Principal; 
PaymentCount := @; 
FOR I := 1 TO Periods DO 
BEGIN 
Inc (PaymentCount) ; 
{ Calculate the interest this period and round it to cents: } 
InterestThisPeriod := 
Int ((RemainingPrincipal * InterestPerPeriod) * 100 + 0.5) / 100.0; 
{ Store values into payments array: } 
WITH Payments’ [PaymentCount] DO 
BEGIN 
IF RemainingPrincipal = @ THEN { Loan's been paid off! } 
BEGIN 
PayInterest := Q; 
PayPrincipal := 0; 
Balance := @; 
END 
ELSE 
BEGIN 
HypotheticalPrincipal := 
MonthlyPI - InterestThisPeriod + ExtraPrincipal; 
IF HypotheticalPrincipal >» RemainingPrincipal THEN 
PayPrincipal := RemainingPrincipal 
ELSE 
PayPrincipal := HypotheticalPrincipal; 
PayInterest := InterestThisPeriod; 
RemainingPrincipal := 
RemainingPrincipal - PayPrincipal; { Update running balance } 
Balance := RemainingPrincipal; 
END ; 
{ Update the cumulative interest and principal fields: } 
IF PaymentCount = 1 THEN 


BEGIN 
PrincipalSoFar := PayPrincipal; 
InterestSoFar := PayInterest; 
END 
ELSE 
BEGIN 
PrincipalSoFar := 
Payments” [PaymentCount-1].PrincipalSoFar + PayPrincipal; 
InterestSoFar := 
Payments’ [PaymentCount-1].InterestSoFar + PayInterest; 
END ; 
END; { WITH } 
END ; { FOR } 
END ; { TMortgage.Recalc } 
PROCEDURE TMortgage.GetPayment (PaymentNumber : Integer; 
VAR ThisPayment : Payment) ; 
BEGIN 
ThisPayment := Payments*[PaymentNumber] ; 
END ; 
PROCEDURE TMortgage.ApplyExtraPrincipal(PaymentNumber : Integer; 
Extra : Real); 
BEGIN 
Payments” [PaymentNumber] .ExtraPrincipal := Extra; 
Recalc; 
END ; 


PROCEDURE TMortgage.RemoveExtraPrincipal (PaymentNumber : Integer) ; 
BEGIN 
Payments” [PaymentNumber] .ExtraPrincipal := 0.0; 
Recalc; 
END ; 
DESTRUCTOR TMortgage.Done; 
BEGIN 
FreeMem(Payments , PaymentSize) ; 
END ; 
END. { MORTGAGE } 


End Listing 
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Innovative Developer's Tools! 


TE Developer's Kit Updated 


Incorporate text editing features into your 
application easily and cost effectively. Features: 
multiple files/windows, word-wrap, edits large files, 
reconfigurable keyboard and screen colors, undo, 
cut/paste, printing, and search/replace. The kit also 
includes TER small editor routine. TER is a subset 
of TE. Your application calls this routine by 
specifying the window location and size, maximum 
file size, and an input buffer or file. Supports 
Microsoft and Borland 'C' compilers. Includes the 
complete source code. (Version: 3.0, DOS: $169, 
Windows: $269) 


Also available version 3.5 that incorporates 
additional word processing features such as bold, 
underline, and italic formats. Paragraph features: 
indentation, double spacing, centering, and right 
justification. The Windows version also includes 
imbedded pictures, multiple fonts and point sizes, 
and many more character styles.(Version 3.5, DOS: 
$249, Windows: $349) 


Report Ease: Report Writer/Mail Merge Engine 


Report Ease consists of a form editor and a 
report executer. Advanced features include: multiple 
files, multiple sorts; data, calculation, system, and 
dialog fields; bold, underline, italic formats; subtotals, 
record filter, functions, word wrapping and more. 
Simple interface works with any application. Includes 
the 'C' source code. (DOS: $349, Windows: $379) 


Spell Time 


Spell Time consists of a dictionary (over 100K 
words) and the routines to access the dictionary. It also 
includes an application specific dictionary, and a user 
dictionary. The routine will suggest alternatives for a 
misspelled word. Highly optimized: 75 to 100 
words/sec on a 12 Mhz 286 computer. An interface 
with TE included. Includes the complete 'C' source 
code. Specify DOS or Windows. ($369) 


Oliitca waleiian 


MultiSpawn: MultiSpawn offers multiple 
methods of running large programs within other programs. 
It returns to the original program after completing the new 
program or by a hot key interruption. Royalty Free. ($149) 


WinMem: WinMem provides an unlimited 
number of global memory handles. It provides virtual 
memory and reduces the overhead per object to 4 bytes. 
Includes source code. ($139) 


Sub Systems, Inc. 
1-800-447-6819 


Fax: 1-617-438-0311 
159 Main Street, #8C, Stoneham, MA 02180, (617)-438-8901 
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GRAPHICS PROGRAMMING 


ADVANCED Listing One (Text begins on page 149.) 


| | Hize@els |"e— rect 
Through i EGP NWA 


ao 


typedef struct _ModelColor 


unsigned char Red; /* 255 = max red, @ = no red */ 





: wy DEVE unsigned char Green; /* 255 = max green, @ = no green */ 
a ‘step by — LOPE = unsigned char Blue; /* 255 = max blue, @ = no blue */ 
} ModelColor; 


process,’ 
subscribers A monthly “how-to” newsletter End Listing One 
for developers working with: 


will learn how Listing Two 


to make these . 
: oe /* Sets up the palette in mode X, to a 2-2-2 general R-G-B organization, with 
technologies 64 separate levels each of pure red, green, and blue. This is very good 


work for them. NEURAL NETWORKS for pure colors, but mediocre at best for mixes. 


'®@ @ | Red;Green; Blue ; 


EXPERT SYSTEMS aera aa a 


GENETIC ALGORITHMS ti Red} 
Jane FUZZY LOGIC |  cereeeeeeeeee eee 


Klimasauskas oo } © ff S222 ZESSE ow! 


Edge (its AND MORE! 76543210 


bhedudtory Offer! 


$99.00/year (Domestic) sscheors ideale 
$179.00/year USD (International) Hanclude "poly gon.h" 


static unsigned char Gamma4Levels[] = { @, 39, 53, 63 }; 


HIGH-TECH COMMUNICATIONS static unsigned char Gamma64Levels[] = { 


@, 10, 14, 17, 19, 21, 23, 24. 26, 27, 28,. 29, 31, 32, 33,34; 


Colors are gamma corrected for a gamma of 2.3 to provide approximately 
even intensity steps on the screen. */ 


hy 103 Buckskin Court 35, 36, 37, 37, 38, 39, 40, 41, 41, 42, 43, 44, 44, 45, 46, 46, 

47, 48, 48, 49, 49, 50, 51, 51, 52, 52, 53, 53, 54, 54, 55, 55, 

Sewickley, PA 15143 USA 56, 56, 57, 57, 58, 58, 59, 59, 60, 60, 61, 61, 62, 62, 63, 63, 
Phone: 412 741 7699 My 

Fax: 412 741 6094 static unsigned char PaletteBlock[256] [3]; /* 256 RGB entries */ 





void InitializePalette() 
CIRCLE NO. 730 ON READER SERVICE CARD 
int Red, Green, Blue, Index; 
union REGS regset; 
struct SREGS sregset; 


for (Red=@; Red<4; Redt++) { 
for (Green=@; Green<4; Greent+) { 
for (Blue=@; Blue<4; Bluet+) { 
Index = (Red<<4)+(Green<<2)+Blue; 
PaletteBlock [Index] [@] = Gamma4Levels [Red] ; 
PaletteBlock [Index] [1] Gamma4Levels [Green] ; 


Do you need a team of 


e e e PaletteBlock [Index] [2] = Gamma4Levels [Blue] ; 
Windows®, Unix® or Macintosh® || 
software development experts? || 222.2829 { cmsensen: 
PaletteBlock[64+Red] [1] = 9; 
PaletteBlock[64+Red] [2] = @; 


for (Green=@0; Green<64; Greent+) f 
PaletteBlock[128+Green] [@] = @; 


' i i PaletteBlock[128+Green] [1] = Gamma64Levels [Green] ; 
sie oe eae , ae vn ; PaletteBlock|1oe+reen! [2] =@; 
software developmen acintosh. 
firm with an 8 year We've developed for (Blue=0; Blue<64; Bluet+) { 
. fo PaletteBlock[192+Blue] [0] = @; 
history of success. Our J many award-winning PaletteBlock[192+Blue] [1] = 0; 


PaletteBlock[192+Blue] [2] Gamma64Levels [Blue] ; 


large, experienced products for our clients. ) 
software engineering But don't take our 


/* Now set up the palette */ 


staff is available for word for it. send regset.x.ax = Ox1@12; /* set block of DAC registers function */ 
: regset.x.bx = Q; /* first DAC location to load */ 
new product design for our portfolio regset.x.cx = 256; /* # of DAC locations to load */ 
regset.x.dx = (unsigned int)PaletteBlock; /* offset of array from which 
and development, and our free to load RGB settings */ 
: - sregset.es = _DS; /* segment of array from which to load settings */ 
conversions and booklet today, int86x(@x10, &regset, Gregset, &sregset); /* load the palette block */ 
} 
special projects. How to Select ng 
poe End Listing Two 
Our expertise and Evaluate ‘eitina th, 
Listing Three 
includes Windows, Software Develop- 
ment Companies. /* Converts a model color (a color in the RGB color cube, in the current 


color model) to a color index for mode X. Pure primary colors are 
special-cased, and everything else is handled by a 2-2-2 model. */ 
int ModelColorToColorIndex(ModelColor * Color) 
{ 
if (Color->Red == @) { 
if (Color->Green == @) { 


230 Western Ave., Boston, MA 02134 oe, eke 
617-782-4877 
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} else if (Color->Blue == @) { 
/* Pure green */ 
return(128+(Color->Green >> 2)); 


} 

} else if ((Color->Green == @) && (Color->Blue == @)) { 
/* Pure red */ 
return(64+(Color->Red >> 2)); 


/* Multi-color mix; look up the index with the two most significant bits 
of each color component */ 
return(((Color->Red & @xC®@) >> 2) | ((Color->Green & @xC®) >> 4) | 
((Color->Blue & @xC®@) >> 6)); 


End Listing Three 
Listing Four 


; Demonstrates drawing solid text on the VGA, using The BitMan's write mode 
; 3-based, one-pass technique. Tested with TASM 3.0 and MASM 5.1. 


CHAR_HEIGHT equ 8 ;# of scan lines per character (must be <256) 
SCREEN_HEIGHT equ 480 ;# of scan lines per screen 

SCREEN_SEGMENT equ @a®@@@h ;where screen memory is 

FG_COLOR equ 14 ;text color 

BG_COLOR equ 1 ;background box color 

GC_ INDEX equ 3ceh ;Graphics Controller (GC) Index reg I/O port 
SET_RESET equ@_ ;Set/Reset register index in GC 

G_MODE equ 5 ;Graphics Mode register index in GC 

BIT_MASK equ 8 ;Bit Mask register index in GC 


.model small 


.stack 200h 

.data 
Line dw ? ;current line # 
CharHeight dw ? ;# of scan lines in each character (must be <256) 
MaxLines dw ? ;max # of scan lines of text that will fit on screen 
LineWidthBytes dw ? ;offset from one scan line to the next 
FontPtr dd ? ;pointer to font with which to draw 


SampleString label byte 
db 'ABCDEFGHIJKLMNOPQRSTUVWXYZ' 
db ‘abcdefghijklmnopqrstuvwxyz' 
db '@123456789!@#S%*&*() ,<.>/73:',O 


.code 

start: 
mov ax,@data 
mov ds,ax 


mov ax,12h 
int 10h ;select 64@x48@ 16-color mode 


mov ah,iih ;BIOS character generator function 
mov al,3@h ;BIOS get font pointer subfunction 
mov bh,3 ;get 8x8 ROM font subsubfunction 
int 10h ;get the pointer to the BIOS 8x8 font 
mov word ptr [FontPtr] ,bp 

mov word ptr [FontPtr+2],es 


mov bx,CHAR_HEIGHT 

mov [CharHeight],bx ;# of scan lines per character 
mov ax,SCREEN_HEIGHT 

sub dx,dx 

div bx 

mul bx ;max # of full scan lines of text that 
mov [MaxLines] ,ax ; will fit on the screen 


mov ah,@fh ;BIOS video status function 
int 10h ;get # of columns (bytes) per row 
mov al,ah ;convert byte columns variable in 
sub ah,ah ; AH to word in AX 
mov [LineWidthBytes] ,ax ;width of scan line in bytes 
;now draw the text 


sub Dx, bz 
mov [Line],bx ;start at scan line @ 
LineLoop: 
sub ax,ax ;start at column @; must be a multiple of 8 


mov ch,FG_COLOR ;color in which to draw text 

mov cl,BG_COLOR ;color in which to draw background box 
mov si,offset SampleString ;text to draw 

call DrawTextString ;draw the sample text 

mov bx, [Line] 

add bx, [CharHeight] ;# of next scan line to draw on 
mov [Line] ,bx 

emp bx, [MaxLines] ;done yet? 

jb LineLoop snot yet 


mov ah,/ 
int 21h ;wait for a key press, without echo 


mov ax,@3h 
int 1@h ;back to text mode 


mov ah,4ch 
int 21h ;exit to DOS 


; Draws a text string. 
; Input: AX = X coordinate at which to draw upper left corner of first char 
Y coordinate at which to draw upper left corner of first char 
foreground (text) color 

CL = background (box) color 

DS:SI = pointer to string to draw, zero terminated 

CharHeight must be set to the height of each character 

FontPtr must be set to the font with which to draw 
; LineWidthBytes must be set to the scan line width in bytes 
; Don't count on any registers other than DS, SS, and SP being preserved. 
; The X coordinate is truncated to a multiple of 8. Characters are 
; assumed to be 8 pixels wide. 

align 2 


(continued on page 108) 
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Senior Software Test Engineer 


At Motorola original thinkers can create the way the world 
communicates. Become part of a high performance, high 
energy team working in our Wireless Enterprise Systems 
start-up environment. 


In this key role you willlead the development of test methods, 
tools and certifications for object oriented software in a 
personal communication platform. You'll also determine/ 
improve test coverage as well as develop and improve test 
metrics. 


You must have a BS/MS degree in Computer Science/ 
Computer Engineering, or 5-7 years’ equivalent experience. 
An understanding of object oriented software design; the 
ability to develop hardware and software diagnostics using 
Hypertalk/Hypercard; knowledge and experience develop- 


ing for the 68000 family of computer hardware; and the ability 
to use the Macintosh MPW development environment are 
essential. Programming languages should include Object-C, 
Object Pascal, Smalltalk or C++ as well as Assembly. Dem- 
onstrated performance on at least two software projects is 
required. Background in GUI is highly desired. 


We offer an excellent salary, a comprehensive benefits 
package and opportunities for professional growth. For im- 
mediate consideration, please send your resume to: Super- 
visor, Professional Recruiting, Dept. DD/DL, Motorola 
Inc., 1501 W. Shure Drive, Arlington Heights, IL 60004. Or, 
FAX your resume to our Resumix FAX line: (708) 632- 
7382. Motorola welcomes and encourages diversity in our 
workforce. We are an equal opportunity employer. 


(AA) MOTOROLA 


Uncompromising 


No Surveys. — 
No Interviews. 
Comm fe) (=e 


(well, sometimes...) 


The Users Journal” 
Simply the best tool for C programmers. Period. 


Yes—I want the best C information Now! 


1 year - $29.95 Mail to: 
Save 49% off our The C Users Journal 
newsstand price Suite 200 


1601 West 23rd Street 
(2) Payment enclosed [)Bill me — Lawrence, KS 66046 


Name 
Company 
Address 
City/State/Zip 
For Faster Service Call: 913-841-1631 ‘i’ 








Foreign subscribers: 1 yr. — $65 US 
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PROJECT 
DUE DATE: 
JUNE 7 





Object Oriented Programming without the right tools is SUICIDE! 








e True C++ dBase Library 
e Support for All Memory Models 


dClass was designed from the ground up as a true C++ class, 
not some old C functions with a class wrapped around them. 
dClass makes it easier for you to access data, and its object 
oriented design lets you create more robust programs with 
fewer errors. And isn't that why you moved to C++ in the first 
place? 


Call 1-800-771-1274 Fax 1-407-297-3566 
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GRAF/DRIVE PLUS 3.0 


Turbo Graphics for Printers, 
Plotters—and now DITP Files 


Publication-quality hard copy graphics for Turbo C & 
Pascal! Install our BGI printer drivers and your screen 
graphics program prints graphs too. Add our new DTP 
file formats and import the graphs to your desktop 
publisher or word processor. Great for user manuals! 


LaserJet, Epson, DeskJet, DeskJet 500C, PostScript, 
ProPrinterX24, PaintJet, HPGL plotters, and more. 
PCX, TIF, CGM, WPG, DXF, Video Show, color dot 
matrix printers, and more (Supplemental Driver Disk). 
Arc, bar, bar3d, circle, fillpoly, line, ellipse, pieslice, 
line/fill patterns, etc—all 100% interchangeable with 
Turbo’s screen graphics. Plot to LPT, COM, or disk. 
Use PostScript fonts and LJ soft fonts (including LJIII 
scalable fonts), plus Borland stroke fonts. 

Not a screen dump. You get the full printer resolution. 
Not TSR. Loads on the heap at runtime. EMS optional. 
Portrait and landscape. Control image size/location. 
Compatible with Graphics Menu, Zinc, Turbo Vision. 
"Highly recommended.” - Jeff Dunteman, Dr. Dobbs, 
11/90. Our customers include Harvard, UCLA, HP, 
TRW, and Los Alamos Labs. 


For Turbo C 2.0, C+ +, Pascal 5-6. Personal License 
$149, Developers with royalty-free distribution $299. 
Supplemental DTP drivers $99. 30-day m/b guarantee. 


FLEMING SOFTWARE BOX 569 OAKTON, VA 22124 
(703) 591-6451 
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DrawCharLoop: 


DrawTextDone: 





GRAPHICS PROGRAMMING 


Listing Four (Listing continued, text begins on page 149.) 


DrawlextString proc near 


cld 
shr ax,l ;byte address of starting X within scan line 
shr ax,1 
shr ax,1 
mov di,ax 
mov ax, [LineWidthBytes] 
mul bx ;start offset of initial scan line 
add di,ax ;start offset of initial byte 
mov , SCREEN_SEGMENT 
,ax ;ES:DI = offset of initial character's 
; first scan line 
;set up the VGA's hardware so that we can 
; fill the latches with the background color 
,GC_ INDEX 
,(@ffh SHL 8) + BIT_MASK 
ax ;set Bit Mask register to OxFF (that's the 
; default, but I'm doing this just to make sure 
; you understand that Bit Mask register and 
; CPU data are ANDed in write mode 3) 
,(@O@3h SHL 8) + G_MODE 
dx, ax ;select write mode 3 
ah,cl ;background color 
al, SET_RESET 
dx, ax ;set the drawing color to background color 
byte ptr es: [@ffffh] ,@ffh ;write 8 pixels of the background 
; color to unused offscreen memory 
cl,es: (@ffffh] ;read the background color back into the 
; latches; the latches are now filled with 
; the background color. The value in CL 
; doesn't matter, we just needed a target 
; for the read, so we could load the latches 
mov ah,ch ;foreground color 
out dx,ax ;set the Set/Reset (drawing) color to the 
; foreground color 
;we're ready to draw! 


DrawlextLoop: 


lodsb ;next character to draw 
and al,al ;end of string? 
jz DrawTextDone ;yes 
push ds ;remember string's segment 
push si ;remember offset of next character in string 
push di ;remember drawing offset 
;load these variables before we wipe out DS 
mov dx, [LineWidthBytes] ;offset from one line to next 
dec ;compensate for STOSB 
ex, [CharHeight] ; 
cl ;offset of character in font table 
lds si, [FontPtr] ;point to font table 
add si,ax ;point to start of character to draw 
;the following loop should be unrolled for 
; maximum performance! 
;draw all lines of the character 
movsb ;get the next byte of the character and draw 
; character; data is ANDed with Bit Mask 
; register to become bit mask, and selects 
; between latch (containing the background 
; color) and Set/Reset register (containing 
; foreground color) 
add di,dx ;point to next line of destination 
loop DrawCharLoop 


pop di ;retrieve initial drawing offset 

ine di ;drawing offset for next char 

pop si ;retrieve offset of next character in string 
pop ds ;retrieve string's segment 

jmp DrawTextLoop ;draw next character, if any 


align 2 
;restore the Graphics Mode register to its 
; default state of write mode @ 
mov dx,GC_INDEX 
mov ax,(@@@h SHL 8) + GMODE 
out dx,ax ;select write mode @ 
ret 


DrawTlextString endp 


end start 


End Listings 
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#60 


How can I access structure fields by name at runtime? 


Build a table of names and offsets using the offsetof( ) macro. 


#62 


I can’t seem to define a linked list node which contains a pointer to 
itself. 


Structs in C can certainly contain pointers to themselves; the 
discussion and example in section 6.5 of K&R make this clear. 
Problems arise if an attempt is made to define (and use) a typedef 
in the midst of such a declaration; avoid this. 


Copyright © 1992 Steve Summit 
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Struggling With Microsoft or Borland C? 
Shape Up With Our Easy3 Step Solution = 


nee meee rem me 


LEARM 4 THROUGH EKEMEISES 


START WITH THE WAITE 

GROUP’S NEW WORKOUT C. 

This balanced training program 
covers all topics and concepts 
of C programming, from simple 
to complex. Throughout more 
than 800 pages of illuminating 
text, hundreds of hands-on 
exercises illustrate everything 


from arrays to unions and bit fields. 


Workout C includes special 
versions of our Power C compiler 
and split-screen text editor, so you 
can compile and execute the 
example programs at the press 
of a key. There is no substitute for 
practice, and you get plenty of it 
with useful and interesting pro- 
grams such as a calculator, card 
playing game, file dumping utility, 
and animated graphics. 

By the time you finish 
Workout C, you’ll have a very solid 
foundation on which to build a 
successful C programming career. 






Order Now and Save $30 Off List Price 





J Please Send FREE Brochures 
Disk Size:0 5.25" 03.5" 
| (Requires DOS 2.0 or higher) 


| Name 

| Company 
| Street 

| City 

| State 
Country 

| Telephone 


Zip 


The High-Performance C Compiler 





AFTER PUMPING UP WITH WORKOUT C, 
YOU'LL BE READY TO DO SOME HEAVY 
DUTY PROGRAMMING WITH POWER C. 
This ANSI standard C compiler is 
now used by over 100,000 program- 
mers worldwide. PC magazine calls 
it “a heavy-weight contender at a 
bantom-weight price,” and Power C 
is a three time winner of Computer 
Shopper’s Best Buy award. 

Even though it’s powerful, 
Power C is very easy to use. That’s 
why The Waite Group chose it over 
all other compilers for Workout C. 
And unlike some compilers that hog 
30 or 40 megabytes of disk space, 
Power C requires only | megabyte. 
FEATURES INCLUDE: 

Over 450 Library Functions 

Mf Small, Medium, & Large Models 
Mf Mixed model with near/far/huge 
Mf Supports 8088/286/386/486 
Supports 8087/287/387 

WM CGA, EGA,VGA, Hercules Graphics 
Mf 650 page manual 


J 


XO) Workout Pack .......297. $49.95 
Includes: & WorkeutC 

Mf Power C*™ 

Mf Power Ctrace 
Shipping & Handling 
($5 USA, $10 Canada, $40 Foreign) 
Sales Tax (Texas Only) ............ 
Total Amount of Order............. 
Paying By: J Check or Money Order 
LY Visa LJ MC LJ Amex LJ Discover 
Card# 
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AS YOU DEVELOP YOUR OWN C 
APPLICATIONS, POWER CTRACE PLAYS 
A CRUCIAL ROLE IN THE PROCESS. 
This state-of-the-art debugger helps 
find your programming errors fast. 
Describing Power Ctrace in a word, 
Computer Shopper simply calls 
it “magnificent.” 
No other debugger shows 
you the internal working of your 
program the way Power Ctrace 
does. Watch your C source code 
executing not just line by line, 
but statement by statement. While 
viewing source code in one window, 
simultaneously view variable values 
and program output in other windows. 
Break and watch points stop exe- 
cution at suspected trouble spots. 
Like Power C, Power Ctrace 
is very simple to use. Combined 
with Workout C, this trio of pro- 
ducts will end your struggles and 
put you in great C programming 
shape. 


60 DAY 
MONEY-BACK 
GUARANTEE 
To Order Please Call: 


1-800-333-0330 


For Technical Questions: 
Tel: 1-214-783-6001 
Fax: 1-214-783-1404 


Mix Software 

1132 Commerce Dr. 
Richardson, TX 
75081 
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Microsoft has announced the comple- 
tion of the first version of the SDK for 
the Messaging API (MAPI). MAPI is a 
set of messaging function calls that 
allow developers to create message- 
enabled applications. The SDK contains 
the Windows operating system DLLs that 
implement the MAPI calls described in 
the finalized portions of the specifica- 
tion. Microsoft will make MAPI avail- 
able on four platforms: Windows, DOS, 
Macintosh, and OS/2, allowing the same 
basic messaging functionality across the 
board. 

Microsoft is making the specification 
for MAPI available to all interested. 
Some areas of the specification are still 
open for developer comment. The ar- 
eas still under review are explicitly 
called out in the specification. Devel- 
opers interested in obtaining the MAPI 
specification can contact Microsoft De- 
veloper Services at 800-227-4679. Read- 
er service no. 20. 

Microsoft Corp. 

One Microsoft Way 
Redmond, WA 98052-6399 
206-882-8080 


Introl’s C compiler line now includes 
support for Motorola’s 68HC16 micro- 
processor. The 68HC16 supports multi- 
ple memory models, making it more 
useful in complicated embedded sys- 
tems than the 68HC11. This latest of- 
fering, coupled with Introl’s C compil- 
ers for the 68HC11 and 68332, amounts 
to a complete set of development tools 
that include the compilers, source-lev- 
el debuggers, target-specific relocating 
assembler, stand-alone library, and sup- 
port utilities. 

Introl C is available for PCs, Macin- 
toshes, and workstations. Prices begin 
at $2000 and vary according to platform. 
Reader service no. 21. 

Introl Corp. 

9220 W. Howard Ave. 
Milwaukee, WI 53228 
414-327-7171 
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The ISIcon programming language and 
ISIcon/SI screen-interface development 
system are two new UNIX products 
from Iconic Software. [SIcon 1.0 is the 
first commercial release of the Icon pro- 
gramming language for UNIX/386 plat- 
forms, with new features designed to 
support a production software devel- 
opment environment. The Screen In- 
terface (SI development system com- 
bines ISIcon with the UNIX industry 
standard character-terminal optimiza- 
tion package, the extended terminal in- 
terface (ETI). This system supports the 
rapid development of high-perfor- 
mance, terminal-independent, full- 
screen user interfaces with ETI charac- 
ter-terminal text windows, menus, and 
forms. Front ends and interfaces de- 
veloped with SI will operate on devices 
ranging from character terminals to 
graphical workstations running termi- 
nal emulators. SI allows you to both 
prototype new interfaces and develop 
transportable front ends to C-language 
applications. 

ISIcon and ISIcon/SI are both avail- 
able for 386- and 486-based PCs run- 
ning most versions of UNIX. List 
prices are $395 for ISIcon and $695 
for ISIcon/SI. Reader service no. 22. 
Iconic Software Inc. _ 

P.O. Box 3097 
Lisle, IL 60532 
800-621-4266 


Magna Carta Software has announced 
the C Communications Toolkit/Extended 
DOS, a C-language developer’s library 
for serial and fax communications. The 
toolkit provides interrupt-driven serial 
communications at speeds of up to 
115,200 bps using standard PC hard- 
ware. For file transfer, you can choose 
from XModem, YModem, ZModem, and 
Kermit protocols. The package supports 
Hayes-compatible modems, emulation 
of VT100, VT52, and ANSI terminals, 
and fax communications. 

For 286 protected mode, the toolkit 
supports Borland C++ and Microsoft C 
using the Phar Lap 286|DOS-Exten- 
der. Applications developed with ei- 
ther compiler can use up to 16 Mbytes 
of memory and use 286 protected- 
mode instructions. For 386 protected 
mode, the toolkit supports the Intel C 
Code Builder 386/486; MetaWare High 
C with Phar Lap 386! DOS-Extender; 
and Watcom C/386 with 386 DOS ex- 
tenders from Rational Systems, Phar 
Lap, or Intel. 

The price is $299.95 and includes 
source code. Reader service no. 23. 
Magna Carta Software 
P.O. Box 475594 
Garland, TX 75047-5594 
214-226-6909 


Coherent 4.0 is a 32-bit version of Mark 
Williams’ UNIX-compatible operating 
system. New to the version is the abil- 
ity to run COFF binaries that run on oth- 
er PC UNIX systems (such as SCO UNIX 
System V/386 3.2.2), allowing users ac- 
cess to software libraries from other PC 
UNIX vendors. 

The distribution consists of six flop- 
py disks, and takes up less than 10 
Mbytes of hard-disk space; the kernel 
is around 100K. Coherent 4.0’s devel- 
opment tools include an optimizing C 
compiler, optimizing linker, and ver- 
sions of lex, yacc, awk, make, termcap, 
terminfo, and curses. A new 386 macro 
assembler is included, as is support for 
conditional assembly, listings, and 
assembly time variables. Text process- 
ing is done with nroff and troff; UUCP, 
ckermit, and kermit handle communica- 
tions. The administrative commands in- 
clude archiving utilities, a Bourne shell, 
a Korn shell, System V-style cron, vir- 
tual console support, and online man 
pages. 

Coherent 4.0 costs $99.95. Reader ser- 
vice no. 24. 

Mark Williams Co. 
60 Revere Drive 
Northbrook, IL 60062 
708-291-6700 


TeraTech announced that they have 
begun shipping Dazzle/VB, an image 
library for Visual Basic. In addition to 
allowing the display of existing images, 
Dazzle lets you draw lines, points, box- 
es, and circles in both filled and out- 
line versions, using any of 256 colors. 
Dazzle also has text-drawing routines 
which can use any font and point size. 
Thirty different screen wipes allow ef- 
fects such as closing curtains, explo- 
sions, implosions, and more. You can 
also write your own wipes using the 
high-speed, partial image-block copy 
routines in Dazzle’s DLL. 

Dazzle Professional has all the same 
features as Dazzle/VB, plus: grey con- 
version and display, full-color negative, 
image compression, color-image en- 
hancement, and extended palette con- 
trol. 

The list price of Dazzle is $299; Daz- 
zle Professional costs $499. Reader ser- 
vice no. 25. 

TeraTech 

3 Choke Cherry Road, Suite 360 
Rockville, MD 20850 
800-447-9120 


Now shipping from Compiler Resources 
is Yacc++, an object-oriented rewrite of 
lex and yacc. Yacct++ creates classes of 
lexer and parser objects that act as call- 
back coroutines within eventdriven ap- 
plications such as found on Windows, 
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THE CHALLENGE BEGINS HERE... 





As a software developer you face 
a daunting task every time you write a 
line of code. You must make your 
software powerful, 
easy-to-use, and 
visually ap- pealing for 
all users of your product from 
the novice to the veteran. Your chal- 
lenge begins with the installation step 
where your customers get their first 
impression of your product. If you 
have a customer that is a power 
user, then that customer 
expects to have full 
control over the installa- 
tion process (drive, 
directory, video, 
configuration, etc). Novices expect to 
be guided to the proper installation 
setup without getting a major head- 
ache. 


INSTALL IS THE SOLUTION 













"This is one toolkit | wish | had 


discovered years ago." 
-- Tom Swan, PC World 


INSTALL is an executable file 
that can automate the installation of 
any product distributed on magnetic or 
CD-ROM media. INSTALL allows you 
to specify the precise installation 
procedure for your product(s). IN- 
STALL provides many benefits to you, 
the software developer, including 
reduced disk costs, automated 
distribution disk 
building, sensing of the 
target computer's 
environment, and the 
deinstallation/updating of 
prior versions. 














"(INSTALL) shows an attention to 
detail that is all too offen missing 
even in the most expensive, 
polished programs. From the 
software's flawless operation to 
the manual’s exemplary layout, 
Knowledge Dynamics has done 
an excellent job." 


— Larry O’Brien, Editor, Computer 
Language Magazine 


INSTALL Pro gives you the best of 


= both worlds, script or no script... 


INSTALL Pro is configured using a script 
file (which is a powerful, easy-to-learn 
"installation language") to allow you to 
have complete control over the installa- 
tion process. Yet, INSTALL Pro also 
includes an automated 
"installation generator" that 
eliminates the need for script 
file programming if you don’t 
need the power of a script 
language. 


NEW VERSION! We are 
now shipping INSTALL and INSTALL 
Pro Versions 3.2 which feature data 
compression that is approximately 40% 
higher than Version 3.1. We still include 
full C source code (rarely needed), and a 
royalty-free license (just like a compiler), 
and we offer a 30-day unconditional 
money-back guarantee. 

If you release updates on a regular 
basis (who doesn’t?) then INSTALL Pro 
will help you manage updates as 
well as the initial installation 
phase. 

INSTALL is even 
available in an international 
version that supports non- 
English-language installations. 
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INSTALL is available for OS/2 1.X/ 
2.0, MS-DOS 2.X and above, and Windows 
3.0/3.1 (coming soon). 


INSTALL is the premium installation 
program on the market when judged by 
such criteria as INSTALL.EXE file size, 
hardware detection (CPU, video, etc), 
reliability, RAM requirements, network 
support, installed base, and technical 
support. You can get another installation 
program for less, but if you have thousands 
of copies of your product "out there" and 
you have a problem, wouldn’t you rather 
have a company backing you that has been 
selling installation programs since 1988? 
All we sell are installation programs and 
INSTALL reflects our commitment to be the 
best. And we are the only installation 
program to win a Computer Language 
Programmer Productivity award. 


"Now you can have an install 
program with the same quality as 
Borland, Microsoft and others. 


Rating 99/100." 


— Atlanta Database Users Society 


Call us toll free to discuss your installation 
requirements or to order INSTALL. 


$249.95 INSTALL Standard 
$399.95 INSTALL Professional 
$349.95 INSTALL International 
$299.95 INSTALL 0S/2 
$399.95 INSTALL Dual Mode 
Other versions available 


1-800-331-2783 Sales 
1-512-964-3994 International 
1-512-964-3929 BBS 
1-512-964-3958 FAX 








KNOWLEDGE 
DYNAMICS 
CORPORATION 
P.O. Box 1558, CANYON LAKE, TX 78130 (USA) 


1-800-331-2783 x191 
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(continued from page 170) 
Presentation Manager, OpenLook, and 
Motif. The lexer and parser objects are 
reentrant and can be active concurrent- 
ly. They are created from grammars, 
which can be modularized and reused, 
owing to multiple inheritance. The 
grammar classes can be dynamically 
bound to lexer and parser objects. 

Yacc++ directly translates regular ex- 
pressions and produces minimal state 
LR(1) lexers and parsers. It comes with 
the Language Objects Library, a class 
library intended for managing lexer and 
parser objects and developing language 
processors. 

Single-user professional versions cost 
$495 for DOS or Windows, $695 for 
OS/2 or PC UNIX, and $995 for SPARC 
or Sun workstations. Source code in- 
cluded. Reader service no. 20. 
Compiler Resources Inc. 

3 Proctor Street 
Hopkinton, MA 01748 
508-435-5016 


M++ QUAD is a C++ numerical inte- 
gration package from Dyad. The pack- 
age works with Dyad’s M++ Scientific 
to allow integration of one- or two-di- 
mensional functions over finite or infi- 
nite limits. The functions can be con- 
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Ph. (414) 231-3333 
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Network- 
Systems 
Design, Inc. 


P.O. Box 379, 2908 Fond du Lac Rd., Oshkosh, WI 54902 
Fax (414) 233-8699 


FlatFunc is a trademark of Network-Systems Design, Inc. 
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tinuous, discontinuous, or infinite. The 
user can select the integration rule, the 
end points, the number of points, and 
the error criteria. 

The integration procedure is based 
upon the adaptive Gauss-Konrod quad- 
rature rules, but nonadaptive rules can 
also be selected. QUAD++ offers a set 
of special-purpose software modules de- 
signed for advanced mathematical op- 
erations, as well as modules for opti- 
mization, statistical utilities, testing, and 
least squares. 

QUAD++ costs $195 for DOS and 
$245 for UNIX and requires M++ Sci- 
entific. Reader service no. 27. 

Dyad Software Corp. 
515 116th Ave. NE, Suite 120 
Bellevue, WA 98004 
800-366-1573 or 206-637-9426 


FlashTek has released two new 32-bit 
DOS extenders: X-32 and X-32VM. 
X-32 (an evolution of the DOSX DOS 
extender included with Zortech C/C++) 
now includes: a debugger interface; 
spawn(), exec(), and system( functions; 
and extender function calls. X-32 is com- 
pact, with executables either slightly 
larger or smaller than equivalent large- 
model 16-bit programs, depending on 
program size. 


WHEN THE 
DOCTOR TALKS... (« 
PEOPLE LISTEN 


X-32VM combines all the features of 
X-32 with up to 3.5 gigabytes of virtu- 
al memory. The package for the Zortech 
compiler sells for $69.00 and includes 
both X-32 and X-32VM. Reader service 
no. 28. 

FlashTek Inc. 

804 Airport Way, Suite D 
Sandpoint, ID 83864 
208-263-7311 


Support for OS/2 2.0 and new 486 op- 
timization has been included in version 
9.0 of the Watcom 32-bit compilers. The 
compilers are supported by various add- 
ons such as libraries for graphics and 
communications and Windows devel- 
opment tools. The tools will support de- 
velopment of 32-bit applications for OS/2 
2.0, and use of OS/2 2.0 as a host sys- 
tem enabling 32-bit cross development 
for a large set of target 32-bit environ- 
ments, including Windows and DOS. 
The new version costs $895. Upgrades 
are $99. Reader service no. 29. 
Watcom 
415 Phillip Street 
Waterloo, Ontario 
Canada N2L 3X2 
800-205-4555 


DDJ 


If the editors of Dr. Dobb's Journal 
give your product a clean bill of health, 
shouldn't everyone know about it? 


2K OK 2 


We can reprint the Doctor's diagnosis 





of your product on 70 Ib. paper stock 
in 4 color, 2 color, or black and white. 


Call Laura Stack Pullen 
415-358-9500 
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BOOKS 


The Little Black Book of 


Ever buy a book that purports to teach 
you about viruses and end up with a 
mouth full of gravel? The Little Black 
Book of Computer Viruses will give 
you the SOLID MEAT you need to 
completely understand computer 
viruses. Includes over 60 pages of fine- 
print assembly uage examples for 

y Cimating book for the 


book has bees Sup “rb tutorial on programming!” 
“Dr. Lud is to be congradulated. I have re-read 
the bak a Gouple of times already.” 


> Ang 


the in(Oné night. It is amazing... this 
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BROWSERS 


SEEING IS BELIEVING WITH 


DocBROWSER 


DocBROWSER is a complete documentation system 
for C and PL/M programs, with printed reports 
on size, callers, users, declarations and more plus 
hot-key access to full featured browser interface 
for all sourse and include files, functions, data 
structures and comments. 

See what one keystroke can do for YOU! 


1 800 927 1481 
Yee 


Mission Viejo, CA 92690 
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CC-Rider 
C++ Rider 


Pop-up C & C++ browsing and 
analysis with cross referencing in 
your own editor! 


The original and the best! 
Western Wares | FREE DEMO | 
(303) 327-4898 
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CD-ROM 





Source Code CDROM $39.95 
600 mB of C, Pascal, Asm. for Unix and DOS. 


GNU/X11R5 CDROM $39.95 
Entire GNU and X11R5 source code. 


MS Windows CD ROM ~— $24.95 


Hundreds of MicroSoft Windows programs. 
Many Other titles available. 


Walnut Creek CDROM 
+1 800 786-9907 eed 


+1 510 947-5996 
+1510 947-1644 FAX 
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¢ Optimizing C compiler with 
assembler for 6805 family 

¢ ROMable code 

¢ Emulator support 

¢ Development shell & editor 

¢ Linker & librarian 


Byte Craft Limited 
421 King Street N., Waterloo, Ont. N2J 4E4 


(519)888-6911 
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FS: PASCAL 


Compile your Pascal Program for 32-bit 
80386 protected mode. Forge about memory 
limitations. Gain speed. Fully compatible with 
TROD. 
Royalty-free DOS extender. Over 250 new 
functions. Starting with $150. 30-day money 
back guarantee. 
To order call: 1-800-934-3732 
1-718-520-4197 
Fax: 1-718-575-8038 


Frontier Software Development Corp. 
68-30 Harrow St., Forest Hills, New York 11375 
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¢ Optimizing C compiler with 
assembler for Zilog Z8 family 

¢ ROMable code 

e Emulator support 

¢ Development shell & editor 


Byte Craft Limited 
421 King Street N., Waterloo, Ont. N2J 4E4 


(519)888-6911 
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Database Library 


SoftC is the database library of choice for 

professional C & C++ developers. It offers 

unsurpassed speed, flexibility and portability along 

with access to industry standard files at a very 

competitive price. 

eFull compatibility with dBASE, Clipper, Alpha IV, 
FoxBASE+ & FoxPro. 


ePortable to DOS, OS/2, UNIX & XENIX. 


elncluded Windows DLL is linkable with most 
DLL-capable compilers (including Visual Basic). 


eSingle/multi-user & network access supported. 
¢100% C source code (both ANSI and K&R). 


eNo Royalties. Version 3.2 now shipping. 


30 day money 
back guarantee. 


Ae 


16820 Third Street Northeast 
Sane Anoka, MN 55304-4703 U.S.A. 


m Fax (612) 434-5023 





Complete Package 
only: 


(612) 434-6968 


New! 
1 Xe). 4 me 40 
A ODD. S BITE 
Support! 


$195 
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BASIC TOC 


BAS_C v5.4 accepts BASICA, Quick BASIC 
v4.5, or other different BASICs by adding new 
statements, functions (Customizer), generates 
structured, indented, scoped C, MS/Turbo/ 
Lattice/Aztec C. All memory model, compile 
option are supported. C source code (Runtime 
Library) is included. Run on MSDOS, XENIX, 
UNIX. 95% conversion ratio. Demo disk. 
From $375. 
GOTOLESS COVERSIONS 
P. O. Box 835910 
Richardson, TX 75083 


(214) 625-2323 
BBS: (214) 625-6905 FAX: (214) 370-2612 





DEVELOPMENT TOOLS 


On-line help & tutorials 
Quality & usability testing 
Custom libraries & utilities 


installation & upgrade programs 


uct to market faster - call us tod: 
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Formerly PforCe by Phoenix. 
Includes: 


¢ Screen Management 


















e B-tree, and more 


— ao ¢ Windowing 
eo ¢ Com 
= — e Database 

a oe ¢ Source Code 

c> 


Programming Modules International 
Box 8402 Green Bay WI 54308 
1-414-468-6040 1-414-465-0464 FAX 
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e Dos and Windows 

@ 2 Gig virtual array supporting insert, delete, 
automatic use of EMS and/or Disk 

e Balanced multiway trees for disk or 
memory 

@ List processing, LZH compression, Sorting 

@ Memory manager detects overwrites, unre- 
leased memory, uses fewer selectors 

| Only $89 w/source - Check/MO/COD 

Specify C++ or C, disk size 

DKS Software 


1201 Lakewood, Allen, Texas 75002 
call/fax (214) 727-6825 
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MS Windows 
¢3D Chart °¢ Table ¢ Status Bar 
¢ Toolbox -¢Ribbon « Field Validation 


ToolsKan set: 3D Chart, Table, Status Bar, Toolbox, 
Ribbon and Field Validation, which are 
Windows custom controls. Applications 
can be quickly & easily built by using the 
ToolsKan development tools. They are 
very easy to use and save you a lot of 
development time. Source codes and 
DLLs are included. 

With Source. NO Royalties. 30-Day Money Back Guarantee. 


Kansmen Corporation 
. Tel: (408) 988-0634 Fax: (408) 988-0639 
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C++/C PROBLEM SOLVER TOOLCHEST | 





Alloc-Ge 
CODEWRIGHT’S TOOLWORKS 


* auto reclaims memory 


A Garace Coutecting * ‘Places malloc()/free() 
MALLOC() ¢ no dangling pointers 






* no memory leaks 
° just relink 


P.O. Box 990 
San Pedro CA 90733-0990 
Voice: 310-514-3151 
Fax: 310-514-3151 


$30 Personal no source 

$50 Personal with source 
$200 Commercial (with source 
$5 Shipping/Handling 
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Now you don't need an ODBMS to implement persistent 
data objects thanks to the Loose Data Binder (LDB 1. 
C++). LDB provides for strong type checking w/ or w/o C++ 
templates and includes an unconventional persistent con 
tainer class having a hybrid stack, queue, deque, list, arra 
interface and built-in sort, search, and iterator functions in 
one flat class. Use LDB to implement algorithms that co: 
ventional “textbook” container libraries can’t touch. An LDB 
network of containers/elements can be streamed using ont 
one insertion/extraction operation. Use LDB as a kernel 


friendiy license agreement. NG help, and source on both 
3.5” and §.25" DOS diskettes. See DDJ's April 92 “Of 
interest.” Scott Guthrey of Austin Code Works calls LDB “a 
sweet piece.” Only $60. To order call (703) 759-3838. PSW, 
P.O. Box 10072, McLean, VA 22102 8072, 
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DOCUMENTATION 








Save time and money! Total development cycle 
support including outline Cost Proposals, Entity 
Relationships, detailed Process Specs, and com- 
plete User Manuals with Index, 
Table of Contents, Getting 
Started, Reference, 
and much more! 
$149.95 
800-221-8275 
(407) 826-0348 





Task Software Co. 2856 Falling Tree Cir., Orlando FL 32837 : : CIRCLE NO 918 ON READER SERVICE CARD 


] LANGUAGES 
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EDUCATION 





B.Sc. & M.S. in COMPUTER SCIENCE 


The American Institute for Computer Sciences offers 
an in depth correspondence program to earn your 
Bachelor of Science and Master of Science degrees 
in Computer Science at home. BSc. subjects covered 
are: MS/DOS, BASIC, PASCAL, C, Data File 


| Processing, Data Structures & Operating systems. 


MS program includes subjects in Software 
Engineering and Artificial Intelligence. 


American Inst. for COMPUTER SCIENCES 
2101 Magnolia Avenue, Suite 200 
Birmingham, AL 35205 
(205) 323-6191 (800) 767-2427 
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¢ EMPLOYMENT. ) 





Programming Jobs Available 


Using your personal computer and 
favorite communication package dial: 
1-800-377-4769 300/1200/2400: 8/N/1 


¢ Review Nationwide Job Listings 

¢ Forward your Resume to Employers 

¢ Build an Effective Resume Online 

e Talk via E-mail to Potential Employers 


CAREER OPS 
A Division of DENEL Solution Systems 
1775 The Exchange, Suite 610 
Atlanta, GA 30339 
Phone (404) 951-0706 * FAX (404) 951-0731 



















_ |Animation. Mouse and joystick support. 








_ | tions, and represent you to the industry. If the venture 


GRAPHICS LIBRARIES 





IMAGE PROCESSING 


Victor Library for C programmers. Powerful image process- 
ing & color reduction, bright/contrast, sharpen, outline, 
resize, overlay, matrix conv., etc. TIFF/PCX/GIF/TGA/bin, use 
exten'd, expan’d, conv mem, 8-bit/24-bit images, any size, 
grays, color, EGA/VGA, up to 1024x768x256, LaserJet, 
ScanJet+, for MSC, QuickC, Turbo C/C++. Source avail, no royal. 
VICTOR LIBRARY, V2.2 $195 
CATENARY SYSTEMS 
470 BELLEVIEW ST LOUIS MO 63119 


VISA/MC CALL/FAX (314) 962-7833 cop 
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|B) == We) OF) BY O7:\\' ME) ww -V/ =, Om = -1e) pl bon k= 
ON TIME AND UNDER BUDGET 


With Building Block Software’s 
CAD/CAM Developer’s Kit (CCDK) Series 
2-D ¢ 3-D @ DXF @ NC-Milling ¢ NC-Turning 


The CCDK’s are libraries of C functions which support 
standard CAD/CAM features such as reading and writing 
DXF files, constructing, editing and graphically displaying 
2D and 3D CAD geometry and computing numerical 
control toolpaths. Kit prices start at $499. Source code, 
maintenance and site licenses are available. 


.371 Moody Street 

Waltham, MA 02154 
VOICE 617-899-4350 
FAX 617-899-4399 














: Building 
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SCIENTIFIC VISUALIZATION 
graphics library for VGA and ET4000 Super 
VGA boards. Over 75 routines callable from 
MS FORTRAN. Up to 1024x768 in 256 
colors. 2D'‘& 3D with perspective. Gouraud 
shaded polygons. Hidden surface removal. 


Store/retrieve in PCX format. $100 postpaid. 
MC/VISA accepted. Call for free demo disk. 
AEROSOFT CORPORATION 


5562 Bells Ferry Rd., Suite 233, Acworth, GA 30102 
TEL/FAX(404) 917-1309 





ASF Software 


Advanced Symbolic Form (ASF) 


ASF is an object-based, realtime, multitasking, 
extensible operating system and language. ASF 
has: only objects, no files virtual a ha of 4 
billion objects; 32-bit real memory addressing. 
Eliminates compilation and linking errors an 
delays. ASF can be targeted to several platforms. 
Free Demo Version available. 

ASF Software 

303 N. Rose St. Suite 315 


Kalamazoo, MI 49007 
(800) 832-4452 
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LEGAL SERVICES 





We will provide legal services to protect and license your 
software, create and act as legal counsel for your corpo- 
ration, provide business services to perform market 
analysis, prepare business plans and financing applica- 


fails you owe us nothing for our services. If it’s a success 


we all profit. 
Call or write today. No obligation. Confidential 


15910 Ventura Blvd., Suite 801A, Encino, CA 91436 
v: 818/379-8770 f: 818/379-8780 CIS: 71401,1653 
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LIBRARIES 


Object Oriented Technology for Engineering, 
Manufacturing, Finance, Education 
Meijin++° 2.0 
Royalty Free 
115+ Classes 


700 Page Manual 

Money Back Guarantee 
Document Source Code 
MS-DOS/Ws, UNIX, SUN 


CALL: 714-755-0995 


Network Integrated Services, Inc. 
Santa Ana, CA Fax: 714-433-2347 
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PGL - Printer Graphics Libraries 


The PGL ToolKit is a set of easy to use 
libraries for generating device independent, 
high resolution graphics output on most popular 
printers. Includes full support for C/C++, 
Fortran, Basic, Pascal, Clipper, & Assembly 


plus 32-bit support for C and Fortran. NO 
ROYALTIES! 


Only $195 (includes all language support) 
Source code available! 


AnSoft, Inc. Voice/FAX: (301) 470-2335 
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dBASE File Access 
From Visual Basic 


Sequiter Software introduces CodeBasic, the 
complete Visual Basic library for accessing and 
creating dBASE Ill/IV, FoxPro and Clipper compat- 
ible data, index and memo files. Use familiar BASE 
commands such as Skip, Go, Append and Pack. 
Distribute your programs royalty free. 


Sequiter Software Inc. 
Tel. (403) 437-2410 Fax (403) 436-2999 
CIRCLE NO. 923 ON READER SERVICE CARD 





Get Extra Memory for your C programs with E-MEM! 
Allocate MegaBytes of RAM! Transpar- 
ently uses either EMS or XMS. From 1k to 16Mb! 
Process your data in Extra Memory instead 
of disk. One customer reported an 18,000% speed gain! 
Process all your data at once, instead of jug- 
gling with what fits in conventional memory! 
E-MEM is just $99. Easy to use, money-back 
guarantee. Ask about our 12 other products for C, Basic 
& Visual Basic when you call: 


TeraTech 
Dept. 137, Suite 360, 3 Choke Cherry Rd., Rockville, 
MD 20850. (301)330-6764 Ext. 137 Fax (301)963-0436 
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C-HEAP More than just another 
memory management library! Over 
600 assembly functions give your 
MSC/Borld C/C++ program unpar- 
alleled capabilities in 
DOS/XMS/EMS/virtual memory 
management. Call for info. 
Libraries Technologies 
800-767-4214 
608-274-4224 
fax 608-833-1171 
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Use a Neural Network in your next 
application. NNetLib is an object code 
library of neural network modules. 5 
models are included: LVQ, SOM, 
BackProp, CounterProp and the Hopftiled 
Net. Turbo C 2.0, MicroSoft C and Turbo 
C++3.0 are supported. $169 


LEVER SOFTWARE SYSTEMS 
Call 1-800 NeuralO (638-7250) 
For a FREE Promo Disk 
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MARKETING 









Launch Your Product 
Do you know all you need to know about getting 
your software product to market? Could you use 
the help of a marketing professional with over 10 
year’s and hundred’s of product launches of 
experience, but can’t afford it? 


Hire a marketing pro for $87 a year! 
The Inside Trac, a bimonthly interactive newslet- 
ter from The Trachtman Group, help companies 
like yours navigate through the maze of issues 
confronting your product launch. 


Promotion Distribution, Sales, Funding, Startup 
Expenses, Support, Pricing & More. 

Call 800/659-7420 or Fax this Ad to 404/455- 

8449 for your first issue free ! No obligation. 


NETWORK 


















Try the Ist truly low cost LAN 
* Connect 2 or 3 PCs, Xts, ATs, PS/2s, 386s 
«x Uses serial ports and 5 wire null modem cable 
« Runs at 115K baud, up to 90 feet 
+ Runs in background, totally transparent 


« Share any device, any file, any time 
« Needs only 14K of ram 
* Version 2.31 * Over 15,000 SOLD 


Skeptical? We make believers! 


Information Modes 

P.O. Drawer F, Denton, TX 76202 
817-387-3339 Technical, 7 days 
1-800 628-7992 Orders 
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RECRUITMENT 


SEEKING PRODUCT 
DEVELOPMENT SPECIALIST 


World‘s #1 Computer 
Book Publisher 











¢ Creative 
¢ Knowledge of C, C++, Pascal 
¢ Experience with MS-DOS, OS/2, Unix 
¢ Technical documentation experience 
Send resume referencing “PDS” to Tina Rodgers, 
Prentice Hall Computer Publishing, 
P.O. Box 90-A, Carmel, IN 46032 
EOE M/F/H/V 
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*Software Jobs Overseas* 


The International Computer 
Professional Association provides 
you with the world-wide contacts and 


up-to-the-moment information you 
need to find an exiting software 
assignment overseas . For FREE 
details on joining the ICPA. 


Call (415) 695-7618 - 24 hours/day 































We specialize in 
MSWindows, Mac 
and OS/2 recruiting. 
Our clients include 
many of the nation’s 
top names in GUI 
based software 
development. So 
give us a call... we 
can help open your 
window to success! 
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EXCELLENT OPPORTUNITIES 
NATIONWIDE 
Software Engineers; Programmers, 
System or Database Developers 
and Analysts 
Fees are employer paid. Call or send 
your resume to: 
Gary Spann 
Ability Search Group 
202 N. Midvale Blvd, Madison WI 


53705 
(608) 231-2421 FAX (608) 231-4453 
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SECURITY 


Since 1986, thousands of companies worldwide 
have chosen Az-Tech Software as partners in 
their fight against Software Pirates. Why? 
Because we offer you proven leaders: 

EVERLOCK 3.0 SOFTWARE COPY PROTECTION 
Option Board Safe—Remote Registration 

New CPU LOCK-CD ROM LOCK and more 


EVERKEY HARDWARE LOCKS 





Az-Tech Software, Inc. 
201 East Franklin, Richmond, MO 64085 
(816) 776-2700 Fax (816) 776-8398 
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TOOLS 


! AUTOMATED DOCUMENTATION! 

* C-CALL ($69) Graphic-tree of caller/called 
hierarchy, files-vs-function table of contents 

* C-CMT ($69) Creates/inserts comment-blocks 
(functions/identifiers used) for each function 

* C-METRIC ($59) Path complexity, and counts 
lines/statements/comments for each function 

* C-LIST ($69) List and action-diagram, or 
reformat source into standardized formats 

* C REF ($59) Local/global/define/parameter 
cross-reference, C++ class hierarchy tree 


* C-DOC ($199) All 5 as 1 DOS program 
* Professional ($299) DOS/OS2/Windows 


SOFTWARE BLACKSMITHS INC. 
6064 St Ives Way, Mississauga, ONT 
Canada L5N-4M1 (416) 858-4466 
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Why you want BATCOM! 


BATCOM is a batch file compiler that compiles 
your “.bat” files to “.exe” files to make them 
faster, more professional, and more capable. 
BATCOM extend DOS with new commands 
so you can read keyboard input, perform 
arithmetic, use subroutines, and much more. 
In addition, BATCOM protects your source 
code, and you can distribute your compiled 
programs with royalties. For IBM PC. Only 
$59.95. Order today! 

/\. Wenham Software Company 

ft e |5 Burley St. 

\_ 7 Wenham, Ma. 01984 


(508) 774-7036 
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SWAINE'S FLAMES 


Thought Experiments 


Assume that the relationship between citizen dissatisfaction and participation in the electoral 
process can be described by a curvilinear function: mild to strong levels of dissatisfaction 
lead to increases in participation, but strong to extreme levels lead to disgust with the 
process and decreased participation. Under this assumption, is it more strategic for a President 
already unpopular among African Americans to attempt to build bridges or to hire the 
(former) Los Angeles Police Chief as an advisor in the hope that people who wouldn’t vote 
for him in any case will be disgusted and stay home? 

You are in a burning building with a teenager and a test tube containing a fertilized human egg. 
The teenager’s leg was broken by the heavy beam that is pinning her to the floor, but she is 
conscious and it is just possible that she could push off the beam and crawl to safety before the 
roof falls in, without your help. The egg, on the other hand, can’t move unaided no matter how 
hard it tries. Just outside are a mobile hospital unit, a team of obstetricians, and a willing surrogate 
mother, all ready to give the egg every chance to grow up to be President of the United States. 
Also, a boy scout with a tourniquet to set that leg. You have time to save only one, the teenager or 
the egg. State which and be prepared to justify your answer in court. 

You are an association of software publishers. The NSA offers you a deal, involving expedited 
export approval for some software, getting around the holdups that the State department imposes 
on software that includes encryption capabilities. All the NSA asks of you is that you not tell 
anyone just what the deal is. What do you do? 

These being thought experiments rather than questions on a test, there are no right answers, but 
there are plenty of wrong ones. More on political participation: If politics is the art of getting 
elected, the problem of the homeless, politically speaking, is the nuisance of sidewalk traffic 
disruption; the disenfranchised are not a constituency. Congressional candidate Glenn Tenney 
(415-574-3420, voice or fax) is my source on governmental efforts to keep e-mail crackable. Aside 
to Nelson Richardson in New York: I can’t answer your question about deconstruction and 
mathematics, but I’m off to France next week to research it. 

And now, here are the answers to June’s quiz.... 





Acronym Event 

RTFM (Universally ignored advice.) Read the flaming manual. (You see this a lot online.) 

RTFS (Same advice, but for engineers.) Read the flaming spec. 

RTFB (Advice for novice users of Macintosh System 7.) Read the flaming balloon. (System 7 has 
“help balloons” for novices.) 

RWFM? TFRM, TFUM, OTFG-SM? (Choices, choices.) Read which flaming manual? The flaming 
reference manual, the flaming user manual, or the flaming getting-started manual? 

IAEF, RTFM (Last-resort advice.) If all else fails, read the flaming manual. (The full version of the 
original acronym.) YWTFS/WRSN&IWTFMASAP (Said the documenter to the developer.) You write 
the flaming software real soon now, and I'll write the flaming manual as soon as possible. 
IRTFM&ISCUTFS/W (Why the advice is universally ignored.) I read the flaming manual and I still 
can’t use the flaming software. (““U” can also mean “understand.”) 

ICRTFM, TIWWTFS/WWTFM2 (Why programmers should program.) I can’t read the flaming 
manual; the one who wrote the flaming software wrote the flaming manual, too. (For hardware, 
the acronym might add, &TIFTT, “and translated it from the Taiwanese.”) 

RWFM? TJOS, TKR, OTKS? Read which flaming manual? The Joy of Sex, the Kinsey Report, or the 
Kama Sutra? (I don’t understand this one.) 


Anagram Event 
Real friend or nuts?=Userland Frontier. Magic land of lost tribes= Bill Gates and Microsoft. 
Comment in jest by MIS; grad gets poorer= Object-oriented programming systems. 


Bonus Question 
Why is Christmas the same as Halloween? Because Dec(imal) 25=Oct(al) 31. 


HilaD Sicads 


Michael Swaine 
editor-at-large 
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With zApp You Can Spend More Time Doing What You Do Best. 


zApp 2.0 isa C++ Application Framework designed to free you from the complex details of 
programming in Windows, OS/2 PM, DOS and UNIX X/MOTIF. With zApp you can now 
cut development time and code size by up to 80%! Leverage your development. Why program 
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SOME OF ZAPP’S 
ADVANCED 
FEATURES INCLUDE: 


Simplified Window 
Definition Create a 
window in one line of 
code. 


Unified Display 
Architecture (Device 
Context) Use the same 
drawing code for windows, 
printers, bitmaps, meta- 
files and OLE. 

Fixed Text Pane Printf 
to a window without wor- 
rying about repainting. 
Logical Size and 
Dimensioning Objects 
Automatically resize objects 
based on display metrics. 





ment experience. Rise above the chaos and join the new order. 

Call us at 1-800-3-INMARK to join the thousands worldwide 
experiencing the benefits of zApp. ‘ty zApp risk free for 
60 days, and if at any time during that period you re not 
satisfied, just return it fora full unconditional refund. 


Frame /Pane 
Architecture Separates 
frame windows, with appli- 
cation menus, from pane 
windows, with display 
code. Pane windows may 
be easily attached to differ- 
ent frames which greatly 
increases code reuse and 
simplifies maintenance. 


Automatic Window 


Sizing and Positioning 
Sizer objects easily specify 
how a window is resized 
in relation to its parent, 
greatly simplifying complex 
screen design. 

Data Entry Forms 
Comprehensive input val- 
idation for simplified data 
entry dialog box creation. 


for only one environment when you can have them all? zApp gives you single source code 


compatibility so you can run your application on all the 
above platforms, just by recompiling. Have all the olyects 
you'll ever need. ZApp provides you with over 130 ready 
made object classes, encapsulating all the facilities you 
need to quickly create sophisticated applications, from 
Windows, Dialogs, Controls, Menus, Fonts, Graphics, Bitmaps, MDI, and 
Printing, to Memory Management, OLE, DDE, Object Persistence, and Forms. 
Get up to speed (in hours, not months). zApp comes with comprehensive 
tutorials and over 35 sample programs, many based on examples from Petzold’s 
“Programming Windows’, and others that show you how to use facilities like 
3_D toolbars and status lines. To keep you programming at your peak, our 
technical support is provided by real programmers with application develop- 


Flexible Message 
Handling Messages can 
be dynamically dispatched 
to any object at compile 
time or runtime. 


Transparent MDI 
Support Instantly convert 
single document applica- 
tions to MDI. Menu 
switching and other 
complex details are han- 
dled automatically. 


Controls Automatically 
superclassed at runtime, 
including third party 
custom controls. 
Automatic Optimized 
Memory Management 
System Hundreds of 
thousands of objects can 
be allocated. 













Advanced j 
Printing Support / 
Includes banding, ¢ 
status dialogs, and 
printer configuration. 
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Pricing 


zApp for: 
Windows $195 


Windows 
w/Soure $295 


DOS & Windows 
w/Soure $495 


OS/2 & Windows 
w/Souree $695 


UNIX X | MOTIF 
w/Source 4thQ’92 





C++ 
Application 
Framework 





zApp requires no runtime royalties, is available with source and is written in standard C++. zApp supports Borland, Microsoft, Zortech, and other major C++ compilers. 
Download the zApp Information Program either from our BBS, or fom the Inmark conference on BIX; or send us a request on CompuServe. This demo is an 
MDI application written in zApp, which lets you examine zApp’s classes, and browse the source code of several sample programs. Corporate training is available. 


oe INMARK 2065 LANDINGS DRIVE, MOUNTAINVIEW, CA 94043 PHONE 415-691-9000 FAX 415-691-9099 BBS 415-691-9990 
COMPUSERVE ID 70550,2570 IN EUROPE CONTACT: GREY MATTER (UK) +44 (0)364 53499 FAX +44 (0)364 53071 
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to C++ Borland has 
youre looking for 






Borland C++ & Application full support for multimedia, TrueType, pen and start with, yet powerful enough to stay with. $99” 
Frameworks 3.1 sets the standard OLE. Now in its third generation, Borland C++ ; 
a ie ++ ; 

For professional developers looking for the & Application Frameworks is the award-winning bab Baldi 3.1 
best development tools for C and C+* in choice of programmers worldwide. $749 Wind y nae 
Windows and DOS, Borland® C++ and Borland C++ 3.1 . un ows programms: 
Application Frameworks has it all. Application eee Designed exclusively for Windows, Turbo C++ 
Franieworks dramatically reduce your develop- The professional C and C++ for Windows is the fastest way to program 
ment time. And the new version 3.1 gives you When you want the best in proven C and Windows applications. $149” 

, WINDOWS aa) C++ professional tools, but don’t need the ‘lneienianiasioniostentenienitenienies 


convenience of application frameworks, 


Borland C++ & ; 
Borland C++ is the right choice. $495 : See your dealer today or call | 


Professional . . = = = 
ania Borland C++ $495" Turbo C++ 3.0 is your starting j 1-800-331-0877, ext. 5255 
Turbo C++ Turbo C++ point for C and C++ H CALL NOW! | 
Entry level for Windows | for DOS Whether you're just learning C, or know C and " See below for free poster offer. , 


$149.95" $99.95* 
No matter what you need, Borland has a C++ for you. L a oe = 





want to move to C++, Turbo C++ is easy to 


BORLAND 


The Leader in Object-Oriented Programming 


A full-color poster of the above illustration is available for the asking ($4.50 shipping and handling). Supply is limited. Call 1-800-344-4394. *Suggested retail price. All prices are in U.S. dollars. Dealer prices may vary. 
Copyright © 1992 Borland International, Inc. All rights reserved. Borland C++ and Turbo C++ are trademarks of Borland International, Inc. Bl 1508D 
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